repoGitHubTrust 82 · PrimaryPublished yesterdayLive · 19h ago
benseverndev-oss/goldenmatch
Zero-config entity resolution & record linkage. The zero-tuning Fellegi-Sunter path beats hand-tuned Splink head-to-head and scales from a CSV to a verified 100M-row dedupe in 9.2 min. Fuzzy/exact/probabilistic + PPRL + LLM + identity graph. Python + edge-safe TypeScript (WASM), SQL-native in Postgres & DuckDB, MCP/REST + dbt/Airflow.
Lineage graph
Paper → model → repo connections mined from source citations (Tier-1 exact match).
