Skip to content
Permalink

Comparing changes

Choose two branches to see what’s changed or to start a new pull request. If you need to, you can also or learn more about diff comparisons.

Open a pull request

Create a new pull request by comparing changes across two branches. If you need to, you can also . Learn more about diff comparisons here.
base repository: exa-labs/simple-evals
Failed to load repositories. Confirm that selected base ref is valid, then try again.
Loading
base: main
Choose a base ref
...
head repository: openai/simple-evals
Failed to load repositories. Confirm that selected head ref is valid, then try again.
Loading
compare: main
Choose a head ref
Checking mergeability… Don’t worry, you can still create the pull request.
  • 12 commits
  • 13 files changed
  • 2 contributors

Commits on Jan 29, 2025

  1. Improve multilingual regexes

    etr2460 committed Jan 29, 2025
    Configuration menu
    Copy the full SHA
    18eba9d View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    f0d5b60 View commit details
    Browse the repository at this point in the history

Commits on Jan 31, 2025

  1. Merge pull request openai#41 from openai/erik/multilingual-answer-reg…

    …ex-improvements
    
    Improve multilingual regexes
    etr2460 authored Jan 31, 2025
    Configuration menu
    Copy the full SHA
    847ec46 View commit details
    Browse the repository at this point in the history
  2. Merge pull request openai#42 from openai/erik/reasoning-effort

    Add reasoning_effort param to O1ChatCompletionSampler
    etr2460 authored Jan 31, 2025
    Configuration menu
    Copy the full SHA
    73600e5 View commit details
    Browse the repository at this point in the history
  3. Changes for o3-mini

    shanth-openai committed Jan 31, 2025
    Configuration menu
    Copy the full SHA
    5ca150b View commit details
    Browse the repository at this point in the history
  4. More changes

    shanth-openai committed Jan 31, 2025
    Configuration menu
    Copy the full SHA
    1e88bdf View commit details
    Browse the repository at this point in the history
  5. Minor

    shanth-openai committed Jan 31, 2025
    Configuration menu
    Copy the full SHA
    77aedc1 View commit details
    Browse the repository at this point in the history
  6. Merge pull request openai#43 from shanth-openai/dev/shanth/o3mini

    Changes for o3-mini
    etr2460 authored Jan 31, 2025
    Configuration menu
    Copy the full SHA
    9768118 View commit details
    Browse the repository at this point in the history

Commits on Feb 1, 2025

  1. Configuration menu
    Copy the full SHA
    9c45dc1 View commit details
    Browse the repository at this point in the history
  2. Merge pull request openai#44 from shanth-openai/dev/shanth/o3minihe

    o3-mini: Add humaneval numbers, reenable it and fix bug
    etr2460 authored Feb 1, 2025
    Configuration menu
    Copy the full SHA
    83ed764 View commit details
    Browse the repository at this point in the history

Commits on Feb 8, 2025

  1. Configuration menu
    Copy the full SHA
    7a1e9e6 View commit details
    Browse the repository at this point in the history
  2. Merge pull request openai#46 from openai/erik/mmmlu-o3-mini

    Add o3-mini MMMLU benchmark results
    etr2460 authored Feb 8, 2025
    Configuration menu
    Copy the full SHA
    6e84f4e View commit details
    Browse the repository at this point in the history
Loading