Perplexity AI launched DRACO, an open-source benchmark evaluating research agents via 100 tasks from...

The AMW Read

Perplexity (a key player in research/search) is updating the agentic evaluation landscape with a production-grounded benchmark, but this is an incremental tool release rather than a structural shift.

NoveltySignificance

AI Agents · Player Map

Perplexity

AI Agents

View Company Profile

Perplexity AI launched DRACO, an open-source benchmark evaluating research agents via 100 tasks from real user queries. Spanning 10 domains, Perplexity leads with 89.4 percent accuracy in Law and 82.4 percent in Academic research. Shifting from synthetic puzzles to production-grounded data creates a rigorous standard for multi-step reasoning. This systemic evolution forces the AI industry to prioritize factual depth over conversational fluency. 🚀

#AIResearch #DRACO #PerplexityAI #LLM #Technology

Explore Perplexity Read Original

How This Connects

Based on AI Agents · Player Map

2d agoOpenAI launches cloud-based workspace agents for Business, Enterprise, Edu, and Teachers plans.OpenAI
2d agoSalesforce launches Agentforce Vibes 2.0 to solve AI agent context overload.Salesforce
1w agoThe $10 Billion Startup Training AI to Replace the White-Collar WorkforceMercor
1w agoMeta paid $2 billion for AI startup Manus, a Singapore‑based firm with Chinese roots. The deal trigg...Manus
1mo ago14.ai just announced they're replacing entire customer support teams at startups with AI, backed by...14.ai
2mo agoPerplexity AI launched DRACO, an open-source benchmark evaluating research agents via 100 tasks from... · THIS ARTICLE

Perplexity AI launched DRACO, an open-source benchmark evaluating research agents via 100 tasks from...

The AMW Read

#AIResearch #DRACO #PerplexityAI #LLM #Technology

How This Connects

Related News

Perplexity launches 'Personal Computer' AI agent for Mac, exclusively for $200/month Max subscribers...

Perplexity has launched the $1 M ‘Billion Dollar Build’ challenge, giving US founders up to $1 M see...

Perplexity AI CEO Aravind Srinivas credits Elon Musk's relentless mindset for building a $9B rival t...

Perplexity launched Computer, a $200/month platform orchestrating 19 AI models through 400+ integrat...

Perplexity just launched Computer, orchestrating 19 AI models across OpenAI, Anthropic, and Google t...

More news from Perplexity

Discover AI Startups

Perplexity AI launched DRACO, an open-source benchmark evaluating research agents via 100 tasks from...

#AIResearch #DRACO #PerplexityAI #LLM #Technology

Related News

Perplexity launches 'Personal Computer' AI agent for Mac, exclusively for $200/month Max subscribers...

Perplexity has launched the $1 M ‘Billion Dollar Build’ challenge, giving US founders up to $1 M see...

Perplexity AI CEO Aravind Srinivas credits Elon Musk's relentless mindset for building a $9B rival t...

Perplexity launched Computer, a $200/month platform orchestrating 19 AI models through 400+ integrat...

Perplexity just launched Computer, orchestrating 19 AI models across OpenAI, Anthropic, and Google t...

More news from Perplexity

Discover AI Startups

Perplexity has launched the $1 M ‘Billion Dollar Build’ challenge, giving US founders up to $1 M see...