AI Personal Learning
and practical guidance
Beanbag Marscode1

AI knowledge

DeepRetrieval: reinforcement learning-driven efficient information retrieval query generation-Chief AI Sharing Circle

DeepRetrieval: efficient information retrieval query generation driven by reinforcement learning

Abstract Information retrieval systems are critical for efficient access to large document collections. Recent approaches utilize Large Language Models (LLMs) to improve retrieval performance through query augmentation, but typically rely on expensive supervised learning or distillation techniques that require significant computational resources and manually labeled data. In ...

OpenAI Releases: How Large Language Models Monitor Their Own Misbehavior - Chief AI Sharing Circle

OpenAI Releases: How Large Language Models Monitor Their Own Misbehavior

Large reasoning models exploit vulnerabilities when given the opportunity. Research has shown that these exploits can be detected by using large language models (LLMs) to monitor their chains-of-thought (CoT). Punishing models for "bad thoughts" does not prevent most misbehavior, but rather allows them to hide their intentions. ...

Optimal Text Segment Selection and URL Rearrangement in DeepSearch/DeepResearch - Chief AI Sharing Circle

Optimal Text Segment Selection and URL Rearrangement in DeepSearch/DeepResearch

If you have read Jina's last classic article "Design and Implementation of DeepSearch/DeepResearch", then you may want to dig deeper into some details that can significantly improve the quality of answers. This time, we will focus on two details: extracting optimal text segments from long web pages: how to utilize late-chun...

Gemma 3 Technical Report in Chinese - Chief AI Sharing Circle

Gemma 3 Technical Report Chinese version

Gemma 3 Key Information Summary I. Key Metrics Parameters Details Model size 100 million to 27 billion parameters in four versions: 1B, 4B, 12B, 27B Architecture Transformer-based decoder-specific architecture inherited from Gemma 2 with several improvements Multimodal capabilities Support for text and image...

IDProtector: ways to protect portraits from abuse of AI-generated technology - Chief AI Sharing Circle

IDProtector: a way to protect portraits from the abuse of AI-generated technology

1. Background and Issues With the rapid development of Artificial Intelligence (AI) technologies, especially the advancement of diffusion modeling, AI has been able to generate very realistic portrait images. For example, technologies like InstantID require only one photo to generate multiple new images with the same identity features. This kind of technology though...

Long Text Vector Modeling a Blind Spot Beyond 4K Tokens? -Chief AI Sharing Circle

Long Text Vector Modeling a Blind Spot Beyond 4K Tokens?

NoLiMA, released in February 2025, is a Large Language Model (LLM) method for assessing long text comprehension. Unlike traditional Needle-in-a-Haystack (NIAH) tests, which rely on keyword matching, NoLiMA is characterized by carefully designed questions and key messages that force...

LangChain vs. LangGraph: Officials Tell You What to Choose - Chief AI Sharing Circle

LangChain vs. LangGraph: The Officials Tell You What to Choose

The field of generative AI is currently evolving rapidly, with new frameworks and technologies emerging. Therefore, readers need to be aware that the content presented in this paper may be time-sensitive. In this paper, we will take an in-depth look at the two dominant frameworks for building LLM applications, LangChain and LangGraph, and analyze their strengths and weaknesses,...

Synergies and Differences between MCP Server, Function Call and Agent - Chief AI Sharing Circle

Synergies and Differences between MCP Server, Function Call and Agent

Understanding the three key concepts of MCP Server, Function Call, and Agent is essential in the burgeoning field of Artificial Intelligence (AI), especially Large Language Modeling (LLM). They are the cornerstones of an AI system, and each has a unique and interrelated role to play. A deeper understanding of it...

How GRPO outperforms o1, o3-mini and R1-Chief AI in the game of "Clue of Time" Share Circle

How GRPO outdid the o1, o3-mini and R1 in the game "Clue of Time".

In recent years, the field of Artificial Intelligence has made significant progress in its reasoning capabilities. After OpenAI demonstrated the powerful inference potential of large-scale language models (LLMs) last year, organizations such as Google DeepMind, Alibaba, DeepSeek, and Anthropic have been quick to follow suit, using reinforcement learning (RL) techniques to train...

Interpretation of Key Parameters of Large Models: Token, Context Length and Output Limits-Chief AI Sharing Circle

Interpreting the key parameters of the big model: Token, context length and output limits

Large-scale language modeling (LLM) is playing an increasingly important role in the field of artificial intelligence. In order to better understand and apply LLMs, we need to gain a deeper understanding of their core concepts. In this paper, we will focus on three key concepts, namely Token, Maximum Output Length, and Context Length, to help readers clear the understanding barriers so as to...

Agentic AI, AI Agents and Agents: concepts explained - Chief AI Sharing Circle

Agentic AI, AI Agents and Agents: a conceptual explanation

Recently, the terms Autonomous AI (AI), AI Agents, and Agents have been popping up a lot. Frankly, despite being data analysts and scientists, industry players have been a bit resistant to these AI-related trends and buzzwords in the past...

AI Coding Editor: Uncovering How Cline Works - Chief AI Sharing Circle

AI Coding Editor: Uncovering How Cline Works

In recent years, Artificial Intelligence (AI) technologies have triggered a profound change in the field of programming. From v0 and bolt.new to programming tools that integrate Agent technology such as Cursor and Windsurf, AI Coding shows great potential to play a key role in the software development process, especially in rapid proto...

What is Artifact Interactive Mode - Chief AI Sharing Circle

What is Artifact Interaction Mode

In the age of AI-assisted programming, we want AI to generate code that is not just static text, but can be parsed, edited, previewed, and even executed. This demand has given rise to a new interaction paradigm - Artifact. In this article, we will analyze Artifact from theoretical concepts to practical implementation....

en_USEnglish