1. School of Information Science and Technology, Yunnan Normal University, Kunming, China
2. AI Products Department, China National Knowledge Infrastructure, Beijing, China
Email: 4176@ynnu.edu.cn (J.C.); yrc16401@cnki.net (R.Y.)
*Corresponding author
Manuscript received November 14, 2025; accepted December 1, 2025; published June 30, 2026
Abstract—Large Language Models (LLMs) have demonstrated remarkable advances in natural language processing and content generation, yet they remain limited in supporting complex research tasks due to hallucinations, shallow summarization, weak multi-step reasoning, and unverifiable outputs. To address these challenges, the concept of DeepResearch has recently emerged, referring to research-oriented agents built on longcontext LLMs and augmented with capabilities such as multistep reasoning, deep retrieval, grounding and citation, agentic orchestration, and structured generation. These mechanisms enable models to act as autonomous researchers, capable of planning, retrieving, and producing systematic, evidence-based reports. Representative systems include Google Gemini Deep Research and OpenAI Deep Research, while in China platforms such as Kimi, Baidu Wenxin, Doubao, and DeepSeek are actively developing localized solutions. This paper provides a taxonomy of core techniques, presents a layered system architecture and architectural paradigms, reviews representative implementations and applications, and highlights open challenges and future directions. By consolidating current progress, we aim to guide the development of reliable and trustworthy DeepResearch agents.
Keywords— DeepResearch; Large Language Models; Research Agents; Retrieval-Augmented Generation; Agentic
Orchestration
Cite: Jinyan Cai and Ruochong Yao, "DeepResearch: A Survey of LLM-based Research Agents," International Journal of Engineering and Technology, vol. 18, no. 2, pp. 92-101, 2026.
Copyright © 2026 by the authors. This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited (
CC BY 4.0).