🚀 Invitation to Experience: China's First AI IDE Intelligent Programming Software Trae Chinese version downloadThe DeepSeek-R1 and Doubao-pro are available for unlimited use!

AI knowledge

sticky (of an Internet forum thread etc)

Application Evaluation of the Use of Inference Models in Modular RAG Systems

In this paper, we present a summary report of Kapa.ai's recent exploration of OpenAI's o3-mini and other inference models in the Retrieval-Augmented Generation (RAG) system. Kapa.ai is an AI assistant powered by a large-scale language model (LLM) that...

2025-03-02AI knowledge

Evaluating creativity in large language models: beyond the multiple-choice LoTbench paradigm

In the field of Large Language Modeling (LLM) research, the model's Leap-of-Thought ability, i.e., creativity, is no less important than the logical reasoning ability represented by Chain-of-Thought. However, there is still a relative lack of in-depth discussions and valid assessment methods for LLM creativity, which is a ...

2025-04-20

Getting to grips with Claude Code: a practical guide to boosting AI programming productivity

Mastering Claude Code: Hands-on Agentic Coding Tips from the Front Lines Claude Code is a command-line tool for Agentic Coding. Agentic coding is the process of giving an AI a degree of autonomy to understand tasks, plan steps, and perform operations (such as reading and writing...

2025-04-20

Trae Chinese Version First Invitation to Download: Unlimited use of DeepSeek-R1 after registration!

Enable Builder Smart Programming Mode, unlimited use of DeepSeek-R1 and DeepSeek-V3, smoother experience than the overseas version. Just enter the Chinese commands, even a novice programmer can write his own apps with zero threshold.

2025-04-29

GPT-4.1 Official Tips Engineering Guide (Chinese version)

The GPT-4.1 family of models offers significant improvements in coding, instruction adherence, and long context processing capabilities over GPT-4o. Specifically, it performs better on code generation and repair tasks, understands and executes complex instructions more accurately, and can efficiently handle longer input text. This hinted work ...

2025-04-17

The GTR framework: a new approach to cross-table Q&A based on heterogeneous graphs and hierarchical retrieval

1. INTRODUCTION In today's information explosion, a large amount of knowledge is stored in the form of tables in web pages, Wikipedia and relational databases. However, traditional question and answer systems often struggle to handle complex queries across multiple tables, which has become a major challenge in the field of artificial intelligence. To address this challenge, researchers ...

2025-04-07

How EQ-Bench Assesses Emotional Intelligence and Creativity in Large Language Models

As the capabilities of large-scale language models (LLMs) evolve at a rapid pace, traditional benchmark tests, such as MMLU, are gradually showing limitations in distinguishing top models. Relying on knowledge quizzes or standardized tests alone, it has become difficult to comprehensively measure the nuanced capabilities of models that are critical in real-world interactions, such as emotional intelligence, creative...

2025-04-01

Reasoning with Large Language Models: Balancing "Underthinking" and "Overthinking"

The development of large language models (LLMs) is rapidly changing, and their reasoning ability has become a key indicator of their intelligence level. In particular, models with long reasoning capabilities, such as OpenAI's o1, DeepSeek-R1, QwQ-32B, and Kimi K1.5, which simulate the human deep thinking process by solving compound...

2025-03-31

突破工具调用瓶颈：CoTools 框架助力大型语言模型高效利用海量工具-首席AI分享圈

Breaking the Tool Calling Bottleneck: The CoTools Framework Enables Large Language Models to Efficiently Utilize a Massive Number of Tools

INTRODUCTION In recent years, Large Language Models (LLMs) have made impressive progress in the field of Artificial Intelligence, and their powerful language comprehension and generation capabilities have led to a wide range of applications in several domains. However, LLMs still face many challenges when dealing with complex tasks that require the invocation of external tools. For example, ...

2025-03-29

uv common commands

The Python ecosystem has always had a shortage of package management and environment management tools, from the classic pip and virtualenv to pip-tools and conda to the modern Poetry and PDM. Each tool has its area of specialization, but they often make a developer's toolchain fragmented and complex. Now, from A...

2025-03-29

Why are multi-intelligence collaborative systems more prone to error?

INTRODUCTION In recent years, multi-intelligent systems (MAS) have attracted much attention in the field of artificial intelligence. These systems attempt to solve complex, multi-step tasks through the collaboration of multiple Large Language Model (LLM) intelligences. However, despite the high expectations of MAS, their performance in real-world applications has not been ...

2025-03-29

Anthropic 深度剖析 Claude：揭示大型语言模型的的决策与推理过程-首席AI分享圈

Anthropic Deep Dive Claude: Revealing Decision Making and Reasoning Processes in Large Language Models

Large Language Models (LLMs) like Claude are not created by humans writing program code; they are trained on massive amounts of data. In the process, the models learn their own strategies for solving problems. These strategies are hidden in the billions of computations the model performs to generate each word for...

2025-03-28

Making AI Stop and Think: How Anthropic's "Think" Tool Enhances Claude Reasoning

Recently, Anthropic has introduced a new tool called "think", which aims to enhance the capability of Claude model in complex problem solving. In this paper, we will discuss the design concept, performance and best practices of the "think" tool, and analyze its implications for the future development of AI systems...

2025-03-24

DeepRetrieval: efficient information retrieval query generation driven by reinforcement learning

Abstract Information retrieval systems are critical for efficient access to large document collections. Recent approaches utilize Large Language Models (LLMs) to improve retrieval performance through query augmentation, but typically rely on expensive supervised learning or distillation techniques that require significant computational resources and manually labeled data. In ...

2025-03-19

OpenAI Releases: How Large Language Models Monitor Their Own Misbehavior

Large reasoning models exploit vulnerabilities when given the opportunity. Research has shown that these exploits can be detected by using large language models (LLMs) to monitor their chains-of-thought (CoT). Punishing models for "bad thoughts" does not prevent most misbehavior, but rather allows them to hide their intentions. ...

2025-03-18

[转载]QwQ-32B 的工具调用能力及 Agentic RAG 应用-首席AI分享圈

[Reprint] QwQ-32B's Tool Calling Capability and Agentic RAG Application

Background Recently, a paper entitled Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning (arxiv.org/pdf/2503.09516) has attracted much attention. The paper proposes a way to use reinforcement learning to train large language...

2025-03-17

LazyGraphRAG：大幅优化 GraphRAG 的质量与成本-首席AI分享圈

LazyGraphRAG: Dramatically Optimizing the Quality and Cost of GraphRAGs

The GraphRAG project aims to extend the range of questions that AI systems can answer on private datasets by exploiting implicit relationships in unstructured text. A key advantage of GraphRAG over traditional vector RAG (or "semantic search") is its ability to answer global queries over entire datasets, such as...

2025-03-17knowledge map Knowledge Retrieval and the RAG Framework

DeepSearch/DeepResearch中最优文本段选择和URL重排-首席AI分享圈

Optimal Text Segment Selection and URL Rearrangement in DeepSearch/DeepResearch

If you have read Jina's last classic article "Design and Implementation of DeepSearch/DeepResearch", then you may want to dig deeper into some details that can significantly improve the quality of answers. This time, we will focus on two details: extracting optimal text segments from long web pages: how to utilize late-chun...

2025-03-13

Gemma 3 Technical Report Chinese version

Gemma 3 Key Information Summary I. Key Metrics Parameters Details Model size 100 million to 27 billion parameters in four versions: 1B, 4B, 12B, 27B Architecture Transformer-based decoder-specific architecture inherited from Gemma 2 with several improvements Multimodal capabilities Support for text and image...

2025-03-13

IDProtector: a way to protect portraits from the abuse of AI-generated technology

1. Background and Issues With the rapid development of Artificial Intelligence (AI) technologies, especially the advancement of diffusion modeling, AI has been able to generate very realistic portrait images. For example, technologies like InstantID require only one photo to generate multiple new images with the same identity features. This kind of technology though...

2025-03-11

1
2
3
4
...
next page
Total 11 pages

AI knowledge

Application Evaluation of the Use of Inference Models in Modular RAG Systems

Evaluating creativity in large language models: beyond the multiple-choice LoTbench paradigm

Getting to grips with Claude Code: a practical guide to boosting AI programming productivity

Trae Chinese Version First Invitation to Download: Unlimited use of DeepSeek-R1 after registration!

GPT-4.1 Official Tips Engineering Guide (Chinese version)

The GTR framework: a new approach to cross-table Q&A based on heterogeneous graphs and hierarchical retrieval

How EQ-Bench Assesses Emotional Intelligence and Creativity in Large Language Models

Reasoning with Large Language Models: Balancing "Underthinking" and "Overthinking"

Breaking the Tool Calling Bottleneck: The CoTools Framework Enables Large Language Models to Efficiently Utilize a Massive Number of Tools

uv common commands

Why are multi-intelligence collaborative systems more prone to error?

Anthropic Deep Dive Claude: Revealing Decision Making and Reasoning Processes in Large Language Models

Making AI Stop and Think: How Anthropic's "Think" Tool Enhances Claude Reasoning

DeepRetrieval: efficient information retrieval query generation driven by reinforcement learning

OpenAI Releases: How Large Language Models Monitor Their Own Misbehavior

[Reprint] QwQ-32B's Tool Calling Capability and Agentic RAG Application

LazyGraphRAG: Dramatically Optimizing the Quality and Cost of GraphRAGs

Optimal Text Segment Selection and URL Rearrangement in DeepSearch/DeepResearch

Gemma 3 Technical Report Chinese version

IDProtector: a way to protect portraits from the abuse of AI-generated technology

Can't find AI tools? Try here!

FLUX.1 image generator (supports Chinese input)

Recent AI Hotspots

AI Tools Recommendations

AI Tools Classification