Comprehensive introduction YOLOv12 is an open source project developed by GitHub user sunsmarterjie , focusing on real-time target detection technology . The project is based on YOLO (You Only Look Once) series of frameworks , the introduction of the attention mechanism to optimize the performance of traditional convolutional neural networks (CNN) , not only in the detection of ...
General Introduction AutoAgent is an open source AI intelligences framework developed by the Hong Kong University Data Intelligence Laboratory (HKUDS) and hosted on GitHub.It allows users to rapidly create and deploy customized AI intelligences by describing their requirements in pure natural language, without any programming foundation. The framework supports a wide range of large language...
Enable Builder Smart Programming Mode, unlimited use of DeepSeek-R1 and DeepSeek-V3, smoother experience than the overseas version. Just enter the Chinese commands, even a novice programmer can write his own apps with zero threshold.
Comprehensive Introduction Crawl4LLM is an open source project jointly developed by Tsinghua University and Carnegie Mellon University, focusing on optimizing the efficiency of web crawling for pre-training of large models (LLM). It significantly reduces ineffective crawling by intelligently selecting high-quality web page data, claiming to be able to originally need to crawl 100 web pages of work...
General Introduction Deepdive Llama3 From Scratch is an open source project hosted on GitHub that focuses on a step-by-step parsing and implementation of the inference process for Llama3 models. It is optimized based on the naklecha/lllama3-from-scratch project, and is designed to help developers and learners deep...
General Introduction Open-Reasoner-Zero is an open source project focused on reinforcement learning (RL) research, developed by the Open-Reasoner-Zero team on GitHub. It aims to accelerate the research process in the field of artificial intelligence by providing an efficient, scalable and easy-to-use training framework, especially to the pass...
General Introduction Arc Institute Evo 2 is an open source project focused on genome modeling and design, developed by Arc Institute, a non-profit research organization based in Palo Alto, California, and launched in collaboration with partners such as NVIDIA. The project builds, through cutting-edge deep learning techniques,...
Comprehensive Introduction VLM-R1 is an open source visual language modeling project developed by Om AI Lab and hosted on GitHub. The project is based on DeepSeek's R1 approach, combined with the Qwen2.5-VL model, which significantly improves the model's visual... through reinforcement learning (R1) and supervised fine-tuning (SFT) techniques.
Comprehensive Introduction Deep Research Web UI is an open source research assistant tool based on AI technology designed to help users conduct deep iterative research on any topic. It combines the power of search engines, web crawling and large-scale language modeling to provide an efficient research experience through an intuitive web interface. Users ...
General Introduction LiteAvatar is an open source tool developed by the HumanAIGC team (under Ali) that focuses on generating facial animations from audio-driven 2D avatars in real-time. It runs at 30 frames per second (fps) relying only on the CPU, and is especially suited for scenarios that require low power consumption, such as real-time 2D...
General Introduction Botgroup.chat is an open source AI group chat application developed based on React and Cloudflare Pages, aiming to provide users with an interactive experience similar to WeChat group chat. It supports multiple AI characters to participate in conversations at the same time, and users can interact with multiple intelligent bots through a simple configuration...
Comprehensive Introduction Open Deep Research is a web-based research assistant capable of generating comprehensive research reports on any topic. The system utilizes a plan-and-do workflow that allows users to plan and review the report structure before moving on to the time-consuming research phase. Users can choose from different...
Comprehensive Introduction KGGen is an open source tool developed by the Stanford Trusted Artificial Intelligence Research Lab (STAIR Lab), hosted on GitHub, designed to automatically generate knowledge graphs from arbitrary text. It uses advanced language models and clustering algorithms to transform unstructured text data into structured real...
General Introduction MultiPost-Extension is a powerful browser extension designed to help users publish content to multiple social media platforms in one click. The extension supports synchronized posting to more than 10 mainstream platforms, including Zhihu, Weibo, Xiaohongshu, TikTok and more. Users don't need to log in, register or mention...
General Introduction Markdownify MCP Server is an open source tool based on the Model Context Protocol, hosted on GitHub and created by developer Zach Caceres. It specializes in combining multiple file types (e.g., PDF, images, audio, office documents, etc.) with...
General Introduction SkyReels-V1 is an open source project developed by the SkyworkAI team focused on generating high-quality, human-centered video content. The project is based on the HunyuanVideo model, and by fine-tuning tens of millions of high-quality movie and TV clips, it creates the world's first human action video base...
Comprehensive Introduction WeChatAI is a Python-based WeChat group chat and personal intelligent assistant, which supports a variety of large language models (such as DeepSeek, Gemini, Tongyi Thousand Questions), and can realize intelligent conversations, auto-replies and other functions. The project adopts modern interface design, simple and intuitive operation, suitable for...
Comprehensive Introduction dsRAG is a high-performance retrieval engine designed to handle complex queries on unstructured data. It performs particularly well in handling challenging queries in dense text such as financial reports, legal documents, and academic papers. dsRAG employs three key approaches to improve performance: semantic segmentation,...
Comprehensive Introduction SongGen is an open source single-stage autoregressive Transformer model designed for text-to-song generation tasks. The model is capable of generating songs containing vocals and accompaniment from text input.SongGen provides fine-grained control over a wide range of musical attributes, including lyrics, instrument descriptions,...
General Introduction Graphiti is a tool developed by getzep for building and querying dynamic, time-aware knowledge graphs. It is capable of representing complex and evolving relationships between entities and querying them through a variety of methods such as temporal, full-text, semantic, and graph algorithms.Graphiti can simultaneously handle non...