Read original ↗
paperarXivTrust 82 · PrimaryPublished 4d agoLive · 3d ago

Research Entity Extraction and Topic Detection from UKRI Grant Proposals

This paper presents preliminary findings from a UKRI-funded Metascience project comparing three LLM-based approaches, GPT-4o, Mistral, and a bespoke algorithm, DSIT-Taxonomies, for extracting and classifying research entities from funding proposals. Our project "Tracking Stars and Unicorns" aims to identify early signals of emerging research areas to inform public investment. Our methodology employed a three-stage pipeline, leveraging Mistral for primary entity extraction and mapping against the OpenAlex Topics taxonomy. We evaluated our approach across 42 proposals' abstracts from different a

Lineage graph

Paper → model → repo connections mined from source citations (Tier-1 exact match).

Topics