Dataset
16 items across the graph — tagged with Dataset.
From the graph · 16
🤗 The largest hub of ready-to-use datasets for AI models with fast, easy-to-use and efficient data manipulation tools
Build, Evaluate, and Optimize AI Systems. Includes evals, RAG, agents, fine-tuning, synthetic data generation, dataset management, MCP, and more.
CSGHub is a brand-new open-source platform for managing LLMs, developed by the OpenCSG team. It offers both open-source and on-premise/SaaS solutions, with feat…
Hugging Face model with 3918 likes. Tags: diffusers, stable-diffusion, text-to-image, dataset:Nerfgun3/bad_prompt, license:creativeml-openrail-m, endpoints_comp…
The platform for LLM evaluations and AI agent testing
Hugging Face model with 2966 likes. Tags: transformers, pytorch, safetensors, gpt_bigcode, text-generation, code, dataset:bigcode/the-stack-dedup, arxiv:1911.02…
The Self-Coding System for Your App — Alan AI SDK for Web
An open source DevOps tool from the CNCF for packaging and versioning AI/ML models, datasets, code, and configuration into an OCI Artifact.
csghub-server is the backend server for CSGHub which helps user to manage datasets, modes, and also run Model Inference, Finetune and Application Spaces.
Awesome_Multimodel is a curated GitHub repository that provides a comprehensive collection of resources for Multimodal Large Language Models (MLLM). It covers d…
AiTLAS implements state-of-the-art AI methods for exploratory and predictive analysis of satellite images.
A large collection of Khmer language resources. Khmer is a language used by Cambodia.
WildlifeDatasets: An open-source toolkit for animal re-identification
Procedural data generators suite for synthetic pretraining and formal reasoning
Testing Theory of Mind (ToM) in language models with epistemic logic
SexEst is an open-source Streamlit web application for predicting biological sex from skeletal measurements using machine learning (XGBoost, LightGBM, Linear Di…
