Find the reference list
Locates bibliography headings, numbered lists, and reference blocks in the PDF.
PDF reference extraction
Upload a paper, manuscript, CV, or research report saved as a PDF. LumaCite finds the reference list, extracts citation fields, helps you review the records, and prepares clean exports.
Choose a paper, manuscript, CV, or research report saved as a PDF.
Waiting for a PDF.
Upload to review
LumaCite turns a document reference list into records you can inspect, correct, and export.
Locates bibliography headings, numbered lists, and reference blocks in the PDF.
Turns individual citations into reviewable rows and separates unrelated text.
Finds titles, authors, years, journals, DOIs, PMIDs, PMCIDs, arXiv IDs, and URLs.
Opens each record beside its highlighted citation text on the source PDF page.
Checks missing fields, conflicts, and duplicates against scholarly sources.
Downloads records in formats compatible with most reference managers.
Research workflows
Turn a PDF bibliography into records you can verify, complete, style, and export.
Frequently asked questions
Practical answers about file requirements, processing, review, metadata, and reference-manager exports.
Text-based PDFs with selectable text and a recognizable reference list work best because LumaCite can read and separate each reference more reliably. Papers, manuscripts, CVs, and reports are supported when saved as PDFs.
The standard extraction endpoint accepts PDFs up to 60 MB. If a file reaches the standard route's size or time limit, LumaCite attempts background processing. Platform upload limits can still apply to very large files.
The extraction service uses a temporary server-side PDF file and deletes that file when processing finishes, including when processing fails. Download or export the citation records you want to keep.
Depending on the identifier and record type, LumaCite may check Crossref, PubMed, Europe PMC, DataCite, OpenAlex, Semantic Scholar, arXiv, Open Library, Google Books, and Unpaywall.
Scanned or image-based PDFs may need OCR before extraction. LumaCite can identify some scanned files and attempt OCR processing, but unclear scans, low-resolution pages, and poor text layers should be reviewed carefully.
Review records with missing fields, no strong identifier, conflicting metadata, unclear source text, possible duplicates, or a Check details status. Before downloading, compare the source text, citation fields, identifiers, and metadata evidence for the records you plan to export.
Yes. Use RIS for broad reference-manager compatibility, EndNote XML for EndNote, or BibTeX and CSL-JSON for compatible tools and writing workflows.
Use the Text Citation Extractor for DOCX, TXT, RTF, RIS, BibTeX, CSV, Markdown, or bibliography text copied from another document.
Learn more
Choose a count to filter the reference list. Selected references are included in export.
Upload a PDF to inspect its source pages here.
The selected citation will be highlighted on its original PDF page.
Upload a PDF to review extraction quality.