AI Personal Learning
and practical guidance
CyberKnife Drawing Mirror
Total 45 articles

Tags: ai speech to text

Meeting: an open source client for local real-time transcription and generation of meeting minutes - Chief AI Sharing Circle

Meeting: local real-time transcription and generation of meeting minutes of the open source client

General Introduction Meeting Minutes (aka Meetily) is a free and open source AI meeting assistant tool developed by Zackriya Solutions that focuses on capturing meeting audio in real-time, generating transcribed text and automatically extracting meeting summaries. The tool runs entirely on local devices and supports macOS ...

FireRedASR: Open Source Model for Multilingual High-Precision Speech Recognition - Chief AI Sharing Circle

FireRedASR: An Open Source Model for Multilingual High-Precision Speech Recognition

Comprehensive Introduction FireRedASR is a speech recognition model developed and open-sourced by the Little Red Book FireRed team, focusing on providing high-precision, multi-language-supported automatic speech recognition (ASR) solutions. The project is hosted on GitHub for developers and researchers, provides industrial-grade design, and supports Mandarin, Chinese...

LiberSonora: Audiobook Subtitle Extraction and Multi-Language Translation, Audiobook Transcription to Multi-Language - Chief AI Sharing Circle

LiberSonora: Audiobook Subtitle Extraction and Multilingual Translation, Audiobook Transcription into Multiple Languages

General Introduction LiberSonora, meaning "free sound", is a powerful AI-enabled open source audiobook toolset that supports intelligent subtitle extraction, AI title generation, and multi-language translation in GPU-accelerated batch offline processing. It supports intelligent subtitle extraction, AI title generation, multi-language translation, etc., and is capable of batch offline processing under GPU acceleration.LiberSonora is designed with the concept of modular...

PengChengStarling: Smaller and Faster Multilingual Speech-to-Text Tool than Whisper-Large v3 - Chief AI Sharing Circle

PengChengStarling: Smaller and Faster Multilingual Speech-to-Text Tool than Whisper-Large v3

Comprehensive Introduction PengChengStarling (PengCheng Labs) is a multilingual Automatic Speech Recognition (ASR) tool capable of converting speech in different languages into corresponding text. This toolkit is developed based on the icefall project and provides a complete speech recognition process, including data processing, model training,...

Notta: AI Meeting Recording and Audio Transcription Tool to Automatically Transcribe Meetings, Interviews or Recordings - Chief AI Sharing Circle

Notta: AI meeting recording and audio transcription tool to automatically transcribe meetings, interviews or recordings

General Introduction Notta is a powerful AI meeting recording and audio transcription tool designed to help users automatically convert meetings, interviews or audio recordings into searchable text. With Notta, users can easily transcribe, edit, summarize and collaborate to boost productivity.Notta supports 58 languages for transcription...

AI no jimaku gumi: Automatic generation and translation of multilingual subtitles for videos with the help of AI

Comprehensive Introduction AI no jimaku gumi (AI no subtitle group) is a powerful command-line video subtitle processing tool focused on enabling automated video subtitle extraction, transcription, and translation functions. The tool integrates advanced AI technologies, including the Whisper speech recognition model and a variety of translation backends (such as Dee...

FunClip: Intelligent editing of video content into short clips, easy to realize accurate video clip extraction/cropping-Chief AI Sharing Circle

FunClip: Intelligent editing of video content into short clips, easy to realize accurate video clip extraction/cropping

Comprehensive Introduction FunClip is a fully open source localized automatic video editing tool developed by TONGYI Speech Lab of Alibaba Dharma Institute. The tool integrates the industrial-grade Paraformer-Large speech recognition model, which can accurately recognize the speech content in the video and convert it to text. Special Features...

Chief AI Sharing Circle

Chief AI Sharing Circle specializes in AI learning, providing comprehensive AI learning content, AI tools and hands-on guidance. Our goal is to help users master AI technology and explore the unlimited potential of AI together through high-quality content and practical experience sharing. Whether you are an AI beginner or a senior expert, this is the ideal place for you to gain knowledge, improve your skills and realize innovation.

Contact Us
en_USEnglish