Skip to main content
Back to News
Perplexity AI launched DRACO, an open-source benchmark evaluating research agents via 100 tasks from...
Technology
1 min read
US

Perplexity AI launched DRACO, an open-source benchmark evaluating research agents via 100 tasks from...

The AMW Read

Perplexity (a key player in research/search) is updating the agentic evaluation landscape with a production-grounded benchmark, but this is an incremental tool release rather than a structural shift.
NoveltySignificance
AI Agents · Player Map

Perplexity AI launched DRACO, an open-source benchmark evaluating research agents via 100 tasks from real user queries. Spanning 10 domains, Perplexity leads with 89.4 percent accuracy in Law and 82.4 percent in Academic research. Shifting from synthetic puzzles to production-grounded data creates a rigorous standard for multi-step reasoning. This systemic evolution forces the AI industry to prioritize factual depth over conversational fluency. 🚀

#AIResearch #DRACO #PerplexityAI #LLM #Technology

How This Connects

Based on AI Agents · Player Map

  1. 2d agoOpenAI launches cloud-based workspace agents for Business, Enterprise, Edu, and Teachers plans.OpenAI
  2. 2d agoSalesforce launches Agentforce Vibes 2.0 to solve AI agent context overload.Salesforce
  3. 1w agoThe $10 Billion Startup Training AI to Replace the White-Collar WorkforceMercor
  4. 1w agoMeta paid $2 billion for AI startup Manus, a Singapore‑based firm with Chinese roots. The deal trigg...Manus
  5. 1mo ago14.ai just announced they're replacing entire customer support teams at startups with AI, backed by...14.ai
  6. 2mo agoPerplexity AI launched DRACO, an open-source benchmark evaluating research agents via 100 tasks from... · THIS ARTICLE

Related News

More news from Perplexity

Stay updated with the latest news and announcements from Perplexity.

View all Perplexity news

Discover AI Startups

Explore 2,000+ AI companies with VC-grade analysis, funding data, and investment insights.

Explore Dashboard