QuentinFuxa / WhisperLiveKit
Real-time & local speech-to-text, translation, and speaker diarization. With server & web UI.
See what the GitHub community is most excited about today.
Real-time & local speech-to-text, translation, and speaker diarization. With server & web UI.
MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and Video Understanding on Your Phone
All Algorithms implemented in Python
Generate audiobooks from e-books
Multi-Platform Live Stream Automatic Recording Tool | 多平台直播流自动录制客户端 · 基于FFmpeg · 支持监控/定时/转�?�
E-mails, subdomains and names Harvester - OSINT
verl: Volcano Engine Reinforcement Learning for LLMs
🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using RAG 🔄.
Open Source Alternative to NotebookLM / Perplexity, connected to external sources such as Search Engines, Slack, Linear, Jira, ClickUp, Confluence, Notion, YouTube, GitHub, Discord and more. Join our discord: https://discord.gg/ejRNvftDp9
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Qlib is an AI-oriented Quant investment platform that aims to use AI tech to empower Quant Research, from exploring ideas to implementing productions. Qlib supports diverse ML modeling paradigms, including supervised learning, market dynamics modeling, and RL, and is now equipped with https://github.com/microsoft/RD-Agent to automate R&D process.
Build Real-Time Knowledge Graphs for AI Agents
Android in docker solution with noVNC supported and video recording
[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer