AI Personal Learning
and practical guidance
TRAE

Articles by Yang Fan

HealthGPT:支持医学图像分析与诊断问答的医疗大模型-首席AI分享圈

HealthGPT: A Medical Big Model to Support Medical Image Analysis and Diagnostic Q&A

Comprehensive Introduction HealthGPT is a state-of-the-art medical grand visual language model designed to enable unified medical visual understanding and generation capabilities through heterogeneous knowledge adaptation. The goal of the project is to integrate medical vision understanding and generation capabilities into a unified autoregressive framework, significantly enhancing the medical image processing...

MatAnyone: 提取视频指定目标人像的开源工具,生成目标人像视频-首席AI分享圈

MatAnyone: Extract video to specify the target portrait of the open-source tool to generate the target portrait video

General Introduction MatAnyone is an open source project focusing on video keying, developed by a research team at S-Lab, Nanyang Technological University, Singapore and released on GitHub. It provides users with stable and efficient video processing capabilities through consistent memory propagation techniques , especially good at dealing with complex backgrounds...

OmniParser:用户界面截图解析成结构化元素,便于大模型理解和操作-首席AI分享圈

OmniParser: user interface screenshots parsed into structured elements for easy understanding and manipulation by large models

General Introduction OmniParser is a tool developed by Microsoft to parse user interface screenshots into structured and easy-to-understand elements. This tool significantly improves the ability of GPT-4V to generate accurate actions in the corresponding interface area.OmniParser not only supports a wide range of large language models, but also...

Trea 结合 Obsidian 变身写作利器:本地知识库升级为 AI 写作助手-首席AI分享圈

Trea combines with Obsidian to become a writing tool: local knowledge base upgraded to an AI writing assistant

This is a reprint of the article, according to the previously written: "Using intelligent programming tools Trae to create an all-powerful writing platform", the next episode will be about how to use Trae to empower the local knowledge base, by the server crash restrained for two days, happened to read this article on the loan of flowers to the Buddha, as a sister article of the original article, included in the...

微软 AI Agent 入门课程:多智能体设计模式-首席AI分享圈

Microsoft AI Agent Introductory Course: Multi-Intelligent Body Design Patterns

When you start working on a project that involves multiple intelligences, you need to consider the Multi-Intelligence Design Pattern. However, it may not be obvious when to move to multi-intelligentsia and what the advantages are. Introduction In this course, Microsoft attempts to answer the following questions: What scenarios are suitable for multi-intelligentsia? What scenarios are suitable for multi-intelligence?

微软 AI Agent 入门课程:规划设计-首席AI分享圈

Microsoft AI Agent Introductory Course: Planning and Design

INTRODUCTION This paper will cover the following: Define clear overarching goals and break down complex tasks into manageable subtasks. Leverage structured output for more reliable and machine-readable responses. Applying an event-driven approach to dynamic tasks and unexpected inputs. Learning Objectives At the completion of this article...

微软 AI Agent 入门课程:Agentic RAG-首席AI分享圈

Microsoft AI Agent Introductory Course: Agentic RAG

This course provides a comprehensive overview of Agentic Retrieval-Augmented Generation (Agentic RAG), an emerging AI paradigm in which Large Language Models (LLMs) autonomously plan their next actions while acquiring information from external sources. Instead of the static "retrieve-then-read" model...

en_USEnglish