AI & ML interests
LLMs
Recent Activity
KlusAI
Where AI research meets real-world impact
π What We're About
KlusAI bridges the gap between cutting-edge AI research and production systems. We publish our datasets and models openly to advance the field β 9M+ synthetic training examples and counting.
Research Themes:
- 𧬠Synthetic Data Generation β Large-scale training data without privacy concerns
- β‘ Efficient AI Systems β Models that run on consumer hardware
- π Multilingual NLP β With deep Romanian language expertise
π Featured Publication
Synthetic Data Generation Using Large Language Models
Advances in Text and Code β IEEE Access, 2025
Our comprehensive survey on generating training data using LLMs. How enterprises can generate training data at scale β reducing annotation costs, addressing data scarcity, and enabling fine-tuning without exposing sensitive data.
π Read on IEEE Xplore Β· π arXiv Preprint
π¬ Flagship Project: TinyFabulist
TinyFabulist is our open research programme on large-scale synthetic narrative generation. We demonstrate that small, efficient models can produce high-quality training data at scale.
| Release | Description | Size |
|---|---|---|
| TinyFabulist v1 | Synthetic English Fables | ~3M examples |
| Upcoming | Multilingual extensions, evaluation benchmarks | β |
Key principles:
- π Scale β 9M+ synthetic training examples generated
- π§ Efficiency β All content produced with β€8B parameter models
- π Openness β Generation scripts, pipelines, and methodology shared publicly
π Paper (arXiv) Β· π» Code (GitHub)
π¦ What You'll Find Here
- Datasets β Large-scale synthetic training corpora for fine-tuning and research
- Models β Efficient, instruction-tuned models optimized for specific tasks
- Evaluation β Benchmarks and tooling for synthetic data quality assessment
π€ Work With Us
Beyond open research, we offer enterprise AI services:
| Service | Description |
|---|---|
| AI Strategy | Define your AI roadmap and implementation plan |
| Custom Development | Bespoke AI solutions tailored to your domain |
| Model Training | Fine-tuning and deploying models for your use case |
| MLOps & Infrastructure | Scalable pipelines and production deployment |
Need custom synthetic data or domain-specific models? We partner with organizations on applied research challenges.
π« Get in Touch
| Purpose | Contact |
|---|---|
| Research collaboration | research@klusai.com |
| Enterprise services | services@klusai.com |
| General inquiries | hello@klusai.com |
Technical questions? Open an issue on the relevant dataset or model repository.
Applied Research Β· AI Services Β· Ventures
klusai.com Β· GitHub Β· X