Stars
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
Video Generation Foundation Models: https://saiyan-world.github.io/goku/
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
A GUI Agent application based on UI-TARS(Vision-Language Model) that allows you to control your computer using natural language.
A rule-based tunnel for Android.
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…
The official Python SDK for Model Context Protocol servers and clients
Autonomously train research-agent LLMs on custom data using reinforcement learning and self-verification.
A browser extension that helps users publish content to multiple social media platforms with one click.
一款部署于云端或本地的隧道代理池中间件,可将静态代理IP灵活运用成隧道IP,提供固定请求地址,一次部署终身使用
A privacy-first, open-source platform for knowledge management and collaboration. Download link: http://github.com/logseq/logseq/releases. roadmap: http://trello.com/b/8txSM12G/roadmap
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
WiFi密码暴力破解工具-图形界面,支持WPA/WPA2/WPA3、多开并发、自动破解、自定义密码本、自动生成密码字典
Learning Convolutional Neural Networks with Interactive Visualization.
ChatGPT + DALL-E + WhatsApp = AI Assistant 🚀 🤖
⭐️⭐️⭐️微服务商城系统 springcloud微服务商城 小程序商城
开源微信爬虫:爬取公众号所有 文章、阅读量、点赞量和评论内容。易部署。持续维护!!!
Image viewer component for vue, supports rotation, scale, zoom and so on, based on viewer.js
支持word(.docx)、excel(.xlsx,.xls)、pdf、pptx等各类型office文件预览的vue组件集合,提供一站式office文件预览方案,支持vue2和3,也支持React等非Vue框架。Web-based pdf, excel, word, pptx preview library
hiprint for Vue2/Vue3 ⚡打印、打印设计、可视化设计器、报表设计、元素编辑、可视化打印编辑
A ready-to-go translation ocr tool developed with WPF/WPF 开发的一款即用即走的翻译、OCR工具
PowerPoint-ist(/'pauəpɔintist/), An online presentation application that replicates most of the commonly used features of MS PowerPoint, allowing for the editing and presentation of PPT online. Sup…