Archēglyph

Citation extraction

Finding and structuring citations — footnote markers, bibliographic entries, in-line references — so they become queryable objects.

Last updated

Finding and structuring the citations — footnote markers, bibliographic entries, in-line references — within a document so they become queryable objects rather than prose.

Why it matters for your research. Once citations are extracted, the corpus becomes a graph: works that cite each other, authors who cluster by whom they cite, gaps in a citation chain that signal a missing source. For academic-historical corpora this is one of the highest-value analyses available.

In Archēglyph. In the Beyond section of the roadmap; non-trivial because citation formats vary wildly across time and field.

Not to be confused with. NER finds people and places; citation extraction finds the references — often overlapping entities, but structured as pointers into the citation graph.

Related terms

References

← Back to the glossary