view article Article Phare LLM benchmark V2: Reasoning models don't guarantee better security 8 days ago • 9
view article Article Auto-Optimize Pydantic Models for Structured Information Extraction: A Complete Guide to DSPydantic 15 days ago
view article Article Risk assessment for LLMs and AI agents: OWASP, MITRE Atlas, and NIST AI RMF explained 16 days ago
view article Article RealPerformance, A Dataset of Language Model Business Compliance Issues Jul 21 • 4
view article Article LLMs recognise bias but also reproduce harmful stereotypes: an analysis of bias in leading LLMs Jul 2 • 16
view article Article Measuring What Matters: Objective Metrics for Image Generation Assessment May 20 • 10
view article Article Good answers are not necessarily factual answers: an analysis of hallucination in leading LLMs May 7 • 42
view article Article 🔥 Announcing FLUX-Juiced: The Fastest Image Generation Endpoint (2.6 times faster)! Apr 23 • 12
view article Article Private Synthetic Data Generation Made Easy: Out-of-the-Box with Docker, Argilla & Ollama Mar 5 • 4
view article Article Agentic RAG Stack (2/5) - Augment retrieval results by reranking using Sentence Transformers Feb 5 • 10
view article Article Agentic RAG Stack (1/5) - Index and retrieve documents for vector search using Sentence Transformers and DuckDB Jan 27 • 21
view article Article Fine-tune ModernBERT for text classification using synthetic data Dec 30, 2024 • 39
view article Article Introducing the Synthetic Data Generator - Build Datasets with Natural Language +4 Dec 16, 2024 • 152
view article Article Open Preference Dataset for Text-to-Image Generation by the 🤗 Community +5 Dec 9, 2024 • 69