Skip to content
View francedot's full-sized avatar

Block or report francedot

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A curated list of resources about AI agents for Computer Use, including research papers, projects, frameworks, and tools.

229 14 Updated Jan 6, 2025

Make websites accessible for AI agents

Python 10,694 829 Updated Jan 5, 2025

A minimalistic AI-powered search engine that helps you find information on the internet. Powered by Vercel AI SDK! Search with models like Grok 2.0.

TypeScript 2,073 232 Updated Jan 6, 2025

APIM Load Balancer for AOAI

Bicep 4 Updated May 31, 2024

llamaindex node parsing for images

Jupyter Notebook 2 Updated Nov 18, 2024
Python 10 9 Updated Jan 3, 2025

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 36,590 4,519 Updated Aug 16, 2024

Inference and training library for high-quality TTS models.

Python 4,867 503 Updated Dec 10, 2024

A simple screen parsing tool towards pure vision based GUI agent

Jupyter Notebook 5,428 426 Updated Jan 5, 2025

An Open Source Python alternative to NotebookLM's podcast feature: Transforming Multimodal Content into Captivating Multilingual Audio Conversations with GenAI

Python 2,340 275 Updated Jan 6, 2025

Agent S: an open agentic framework that uses computers like a human

Python 734 99 Updated Jan 6, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 33,239 5,056 Updated Jan 6, 2025

Windows Agent Arena (WAA) 🪟 is a scalable OS platform for testing and benchmarking of multi-modal AI agents.

Python 548 52 Updated Nov 20, 2024
JavaScript 2,765 994 Updated Jun 21, 2024

🚀 Automatically deploy your project to GitHub Pages using GitHub Actions. This action can be configured to push your production-ready code into any branch you'd like.

TypeScript 4,327 364 Updated Jan 6, 2025

[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

Python 1,502 167 Updated Dec 20, 2024

OpenUI let's you describe UI using your imagination, then see it rendered live.

TypeScript 19,634 1,828 Updated Oct 21, 2024

Awesome list of 300+ agentic AI resources

Python 424 44 Updated Aug 5, 2024

A UI-Focused Agent for Windows OS Interaction.

Python 8,107 1,076 Updated Jan 6, 2025

A personal wearable AI that runs locally

Python 536 53 Updated Mar 17, 2024

signs wda

8 6 Updated Jan 9, 2022

This is an operating system independent implementation of iOS device features. You can run UI tests, launch or kill apps, install apps etc. with it.

Go 1,013 189 Updated Jan 6, 2025
Python 38 3 Updated Feb 18, 2024

InterfaceAgent: a versatile framework designed to create system and interface agents capable of managing mobile and desktop applications and features.

TypeScript 109 3 Updated May 1, 2024

Next.js Chrome Extension Starter example application that demonstrates how to build a Chrome extension using Next.js. It provides a foundation for developing Chrome extensions with Next.js, React a…

HTML 166 51 Updated Nov 30, 2024

Small "Pin To TaskBar" exe for Command Line, tested on Windows 10 Version 20H2 (Win10 19042.964). Reverse engineering of syspin.exe "PE injection into Progman" method.

C 73 15 Updated Jan 24, 2023

Embed Vite + Vue in Express

JavaScript 5 1 Updated Mar 9, 2023

A sample app for the Retrieval-Augmented Generation pattern running in Azure, using Azure AI Search for retrieval and Azure OpenAI large language models to power ChatGPT-style and Q&A experiences.

Python 6,505 4,365 Updated Jan 6, 2025
Next