Skip to content
View bstee615's full-sized avatar

Organizations

@microsoft

Block or report bstee615

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An agent benchmark with tasks in a simulated software company.

Python 82 7 Updated Dec 20, 2024

A curated list of awesome jq tools and resources.

831 42 Updated Dec 14, 2024

Get your documents ready for gen AI

Python 16,800 867 Updated Dec 19, 2024

Official repository for our paper "FullStack Bench: Evaluating LLMs as Full Stack Coders"

Python 51 3 Updated Dec 12, 2024
Python 13 1 Updated Nov 23, 2024
Jupyter Notebook 17 9 Updated Aug 14, 2024

Anthropic's Interactive Prompt Engineering Tutorial

Jupyter Notebook 1,978 209 Updated Jul 11, 2024

LLMSAN: Sanitizing Large Language Models in Bug Detection with Data-Flow

Java 1 Updated Oct 6, 2024

A framework for serving and evaluating LLM routers - save LLM costs without compromising quality!

Python 3,390 253 Updated Aug 10, 2024

Set of tools to assess and improve LLM security.

Python 2,807 463 Updated Dec 20, 2024

Goshawk is a static analyze tool to detect memory corruption bugs in C source codes. It utilizes NLP to infer custom memory management functions and uses data flow analysis to abstract their behavi…

C++ 80 15 Updated Dec 18, 2023

Windows inside a Docker container.

Shell 31,159 2,128 Updated Dec 21, 2024
Java 25 6 Updated Jan 27, 2024

A Unified Library for Parameter-Efficient and Modular Transfer Learning

Jupyter Notebook 2,609 354 Updated Dec 25, 2024

OWASP Benchmark is a test suite designed to verify the speed and accuracy of software vulnerability detection tools. A fully runnable web app written in Java, it supports analysis by Static (SAST),…

Java 675 1,087 Updated Dec 16, 2024

A test suite designed to verify the speed and accuracy of software vulnerability detection tools

Java 1 Updated Jul 12, 2024

A resource leak repository

6 Updated Oct 22, 2024

🙌 OpenHands: Code Less, Make More

Python 39,285 4,427 Updated Dec 28, 2024

A manually vetted dataset for security vulnerability detection in Java projects

Python 23 3 Updated Dec 9, 2024

Friends don't let friends make certain types of data visualization - What are they and why are they bad.

R 6,519 253 Updated Dec 10, 2024

Security vulnerability database inclusive of CVEs and GitHub originated security advisories from the world of open source software.

1,778 342 Updated Dec 28, 2024

Zero shot vulnerability discovery using LLMs

Python 1,260 129 Updated Oct 31, 2024

State-of-the-art native debugging tools

C 2,986 381 Updated Dec 25, 2024

A curated list of awesome remote jobs and resources. Inspired by https://github.com/vinta/awesome-python

35,775 3,993 Updated Dec 5, 2024

Create web-based user interfaces with Python. The nice way.

Python 10,431 624 Updated Dec 28, 2024

This repo will contain the code of a paper we are publishing.

Python 1 Updated Oct 5, 2024

Prompt design using JSX.

TypeScript 2,178 121 Updated Oct 14, 2024

[NeurIPS'24] SemCoder: Training Code Language Models with Comprehensive Semantics Reasoning

Python 11 1 Updated Nov 19, 2024

An overview of LLMs for cybersecurity.

539 51 Updated Sep 21, 2024

Code for "An Empirical Study of Deep Learning Models for Vulnerability Detection", published in ICSE 2023.

Jupyter Notebook 8 Updated Jun 23, 2024
Next