AI Personal Learning
and practical guidance
TRAE
Total 1020 articles

Tags: ai open source projects Page 47

AigoTools:自动收录网站并支持多语言的开源AI工具导航站-首席AI分享圈

AigoTools: automatic inclusion of the site and support for multilingual open source AI tools navigation station

Comprehensive Introduction AigoTools is an open source AI web site navigation , designed to help users quickly create and manage navigation sites . It has built-in site management and AI-based automatic inclusion features , support for multiple languages , dark/light theme switching , and SEO optimization.AigoTools provides a variety of image storage solutions , including this ...

Amphion MaskGCT:零样本文本到语音克隆模型(本地一键部署包)-首席AI分享圈

Amphion MaskGCT: Zero-sample text-to-speech cloning model (local one-click deployment package)

Comprehensive Introduction MaskGCT (Masked Generative Codec Transformer) is a completely non-autoregressive Text-to-Speech (TTS) model jointly introduced by Funky Maru Technology and The Chinese University of Hong Kong. The model does not require explicit text-to-speech alignment information and adopts a two-stage generation approach, which first passes ...

PDF to Podcast: Convert PDF to Podcast Utility

General Introduction Inspired by the podcast generation features of Notebook LM and the recent Open Notebook LM open source implementation. In this recipe, we will implement a detailed step-by-step guide on how to build a PDF to podcast pipeline. Given any PDF, we will generate a segment where the host and guest discuss and explain ...

MindSearch:开源AI搜索引擎框架,部署您自己的 Perplexity 搜索引擎!-首席AI分享圈

MindSearch: open source AI search engine framework to deploy your own Perplexity search engine!

Comprehensive Introduction MindSearch is an open source AI search engine framework launched by Shanghai Artificial Intelligence Laboratory (SAL), which aims to simulate human thought process for complex information gathering and integration. The tool combines the advanced technology of large-scale language modeling (LLM) and search engine with a multi-intelligence body framework to achieve the...

CosyVoice:阿里推出的3秒急速语音克隆开源项目,支持情感控制标签-首席AI分享圈

CosyVoice: 3-second rush voice cloning open source project launched by Ali with support for emotionally controlled tags

Comprehensive Introduction CosyVoice is a multilingual large-scale speech generation model that provides full-stack capabilities from inference, training to deployment. Developed by FunAudioLLM team, it aims to achieve high quality speech synthesis through advanced autoregressive transformers and ODE-based diffusion models.CosyVoice not only supports...

Fabric:集成众多提示词的AI开源工作流框架,高效处理各种事务-首席AI分享圈

Fabric: an AI open source workflow framework that integrates many cue words to efficiently handle a variety of transactions

General Introduction Fabric is an open source AI framework developed by Daniel Miessler to simplify and automate everyday computer tasks and make artificial intelligence easier to use. It helps users efficiently handle a variety of tasks such as content summarization, data extraction through modular design and preset prompt words (Patterns)...

TANGO:语音生成协调手势人像视频的工具,全身像数字人-首席AI分享圈

TANGO: a tool for voice-generated coordinated gesture portrait videos with full-body digital humans

General Introduction TANGO (Co-Speech Gesture Video Reenactment with Hierarchical Audio-Motion Embedding and Diffusion Interpolation) is an open source collaborative speech gesture video generation framework jointly developed by the University of Tokyo and CyberAgent AI Labs An open source collaborative speech gesture video generation framework jointly developed by the University of Tokyo and CyberAgent AI Lab. The ...

Pyramid Flow:快手推出的开源版

Pyramid Flow: an open source version of "Kringle" launched by Racer, based on SD3 and running on GPUs of less than 8GB (one-click deployment version)

Comprehensive Introduction Pyramid Flow is an efficient autoregressive video generation method based on the Flow Matching technique. The method enables generation and decompression of video content with higher computational efficiency by interpolating between different resolutions and noise levels.Pyramid Flow is capable of generating high quality...

Dify:生成式AI应用开发平台,可视化编排, 支持私有化部署-首席AI分享圈

Dify: generative AI application development platform, visual orchestration, private deployment support

Comprehensive Introduction Dify is an open source generative AI application development platform designed to help developers rapidly build and operate native AI applications based on Large Language Models (LLMs). The platform provides a variety of functions from Agent construction to AI workflow orchestration, RAG retrieval, model management, etc., supporting the development of...

ModelBest(面壁智能):全球领先的轻量高性能端侧大模型-首席AI分享圈

ModelBest: The World's Leading Lightweight, High-Performance End-Side Big Model

General Introduction ModelBest is a company specializing in developing lightweight and high-performance large models, dedicated to applying advanced AI technologies to mainstream consumer electronics and various end devices in daily life. Its MiniCPM series of end-side models are known for their extreme arithmetic power and memory usage efficiency, with small parameter counts,...

en_USEnglish