paperarXivTrust 82 · PrimaryPublished 3d agoLive · 2d ago

Explicit Fuzzy Logic in the Feed-Forward Layer: Self-Forgetting Quantifiers Discover Legible Grammatical-Licensing Detectors

A transformer's feed-forward (FFN) sublayer materializes the distinctions attention gathers, yet gives no account of what it computes. In a parameter-neutral replacement, each hidden unit is an explicit fuzzy set operation on sigmoid-bounded [0,1] memberships: intersection A*B and set-difference A*(1-B), the latter a bounded positive negation ("A but not B") that gated/bilinear units lack -- a negation-capable FFN (NC-FFN). On N-bit parity they are the most parameter-efficient reasoning basis at shallow depth; at scale (125M, OpenWebText) NC-FFN ties the GELU baseline's perplexity, every unit

Lineage graph

Paper → model → repo connections mined from source citations (Tier-1 exact match).

Related to

glossary_termTransformer

Implements (incoming)

reposonos/tract

Related across the graph

glossary_termTransformer reposonos/tract

Topics

cs.CL