repoGitHubTrust 82 Β· PrimaryPublished 19h agoLive Β· 19h ago
Giskard-AI/giskard-oss
π’ Open-Source Evaluation & Testing library for LLM Agents
Lineage graph
Paper β model β repo connections mined from source citations (Tier-1 exact match).
Implements
Implements (incoming)
Related across the graph
paperTraceLab: Characterizing Coding Agent Workloads for LLM ServingpaperAgenticSTS: A Bounded-Memory Testbed for Long-Horizon LLM AgentspaperPACE: A Proxy for Agentic Capability EvaluationpaperTestEvo-Bench: An Executable and Live Benchmark for Test and Code Co-EvolutionpaperA$^{2}$utoLPBench: An Auto-Generated, Agent-Friendly LP Benchmark via Inverse-KKT Construction
