repoGitHubTrust 82 · PrimaryPublished 20h agoLive · 18h ago
thu-pacman/chitu
High-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.
Lineage graph
Paper → model → repo connections mined from source citations (Tier-1 exact match).
Implements
paperRaBitQCache: Rotated Binary Quantization for KVCache in Long Context LLM InferencepaperUnderstanding Large Language ModelspaperWhen are likely answers right? On Sequence Probability and Correctness in LLMspaperMessage Passing Enables Efficient ReasoningpaperCross-lingual Relation Extraction with Large Language Models: Zero-Shot, Few-Shot, and Fine-Tuned Evaluation on Romanian
Implements (incoming)
paperThe Grammar Does the Work: Functional vs. Lexical Dependency Length Minimization Across Universal DependenciespaperNAVER LABS Europe Submission to the Instruction-following 2026 Short TrackpaperBayesian Sparse Low-Rank Adaptation for Large Language Model Uncertainty EstimationpaperUnlocking Speech-Text Compositional Powers: Instruction-Following Speech Language Models without Instruction TuningpaperBamiBERT: A New BERT-based Language Model for VietnamesepaperOn the Role of Directionality in Structural GeneralizationpaperAn Efficient vLLM-Based Inference Pipeline for Unified Audio Understanding and GenerationpaperReContext: Recursive Evidence Replay as LLM Harness for Long-Context Reasoning
Related across the graph
paperOn the Role of Directionality in Structural GeneralizationpaperReContext: Recursive Evidence Replay as LLM Harness for Long-Context ReasoningpaperAn Efficient vLLM-Based Inference Pipeline for Unified Audio Understanding and GenerationpaperWhen are likely answers right? On Sequence Probability and Correctness in LLMspaperCross-lingual Relation Extraction with Large Language Models: Zero-Shot, Few-Shot, and Fine-Tuned Evaluation on RomanianpaperMessage Passing Enables Efficient ReasoningpaperBayesian Sparse Low-Rank Adaptation for Large Language Model Uncertainty EstimationpaperUnlocking Speech-Text Compositional Powers: Instruction-Following Speech Language Models without Instruction TuningpaperThe Grammar Does the Work: Functional vs. Lexical Dependency Length Minimization Across Universal DependenciespaperUnderstanding Large Language ModelspaperRaBitQCache: Rotated Binary Quantization for KVCache in Long Context LLM InferencepaperBamiBERT: A New BERT-based Language Model for VietnamesepaperNAVER LABS Europe Submission to the Instruction-following 2026 Short Track
