Startups & Business News

TreeQuest by Sakana AI: Unlocking the Power of Multi-Model AI Collaboration

Discover how Sakana AI’s TreeQuest enables multi-model AI collaboration, boosting problem-solving power and accuracy. Explore the future of AI teamwork.

The world of artificial intelligence is seeing a major shift with the arrival of TreeQuest from Sakana AI. This open-source framework is designed to let multiple large language models (LLMs) work together on complex tasks, creating a powerful AI ensemble that outperforms any single model.

What Sets TreeQuest Apart?

At the core of TreeQuest is the Adaptive Branching Monte Carlo Tree Search (AB-MCTS) algorithm. Originally inspired by strategies used in game AI, AB-MCTS allows TreeQuest to balance between searching deeper (refining promising solutions) and searching wider (exploring new alternatives). What’s unique is TreeQuest’s ability to dynamically assign the best-suited LLM to each step of a task. Over time, it learns which models excel at specific subtasks and adapts its strategy accordingly.

This approach is different from traditional Mixture of Experts (MoE), which operates within a single model. TreeQuest orchestrates independently trained models—such as GPT, Gemini, and DeepSeek—in real time, making it modular, flexible, and adaptable for a wide range of applications.

Results of AB-MCTS and Multi-LLM AB-MCTS on ARC-AGI-2, showing Pass@k as a function of the number of LLM calls.
Results of AB-MCTS and Multi-LLM AB-MCTS on ARC-AGI-2, showing Pass@k as a function of the number of LLM calls.

Why Multi-Model Teams Are More Effective

Every LLM has its unique strengths and weaknesses. For example, one model might be great at logical reasoning but struggle with creative writing, while another is better at generating code but less accurate with facts. TreeQuest leverages this diversity by assigning the right model to the right part of a problem, much like assembling a team of specialists.

In tests on the ARC-AGI-2 benchmark, TreeQuest’s ensemble of o4-mini, Gemini 2.5 Pro, and DeepSeek-R1 solved over 30% of challenging visual reasoning problems—a significant improvement over what any individual model achieved alone. The system even demonstrated iterative error correction, where one model’s output was improved upon by others, leading to solutions that no single model could reach.

Technical Architecture and Real-World Use Cases

TreeQuest’s architecture includes:

  • Orchestration Engine: Coordinates the decision tree and routes tasks based on current needs.

  • Agent Capability Modeling: Continuously assesses each model’s performance and adjusts routing

  • Parallel Execution Framework: Explores multiple solution paths at once for better results

  • Adaptive Learning System: Learns from every interaction to improve future decisions

The open-source framework is already showing promise in areas like algorithmic codingmachine learning optimization, and software performance tuning. It’s also being explored as a way to reduce hallucinations by assigning fact-sensitive tasks to more grounded models and creative tasks to others, achieving a balance between accuracy and fluency.

TreeQuest represents a new era in AI, where collaboration, adaptability, and diversity are the keys to tackling complex problems that single models can’t solve alone.

futureTEKnow covers technology, startups, and business news, highlighting trends and updates across AI, Immersive Tech, Space, and robotics.

futureTEKnow

Editorial Team

futureTEKnow is a leading source for Technology, Startups, and Business News, spotlighting the most innovative companies and breakthrough trends in emerging tech sectors like Artificial Intelligence (AI), Robotics, and the Space Industry.

Discover the companies and startups shaping tomorrow — explore the future of technology today.

Join Our Newsletter

* indicates required

Intuit Mailchimp

Trending Companies

Latest Articles

futureTEKnow is focused on identifying and promoting creators, disruptors and innovators, and serving as a vital resource for those interested in the latest advancements in technology.

© 2026 All Rights Reserved.