Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
jianfch authored Mar 17, 2023
1 parent 75611f7 commit 0f2f699
Showing 1 changed file with 2 additions and 6 deletions.
8 changes: 2 additions & 6 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -4,19 +4,15 @@ This script modifies [OpenAI's Whisper](https://github.com/openai/whisper) to pr

![jfk](https://user-images.githubusercontent.com/28970749/225825244-f3df9607-91ab-4011-a333-7e3ae94da08f.PNG)


https://user-images.githubusercontent.com/28970749/225825286-cdb14d70-566f-454b-a2b3-b61b4b3e09c9.mp4


### What's new in 2.0.0 ?
- updated to use Whisper's more reliable word-level timestamps method.
- the more reliable word timestamps allows regrouping segments word by word.
- can now suppress silence with [Silero VAD](https://github.com/snakers4/silero-vad) (requires PyTorch 1.2.0+)
- non-VAD silencing suppress is also more robust


https://user-images.githubusercontent.com/28970749/225825316-41060f7f-0676-4d7c-972a-cd318b8a79c2.mp4

https://user-images.githubusercontent.com/28970749/225826345-ef7115db-51e4-4b23-aedd-069389b8ae43.mp4

### Features
- more control over the timestamps than default Whisper
Expand Down Expand Up @@ -67,7 +63,7 @@ result.save_as_json('audio.json')

### Regrouping Words
Stable-ts has a preset for regrouping word into different segments. This preset is enabled by `regroup=True`.
But are other built-in regrouping methods that allow you to customize the regrouping logic.
But there are other built-in regrouping methods that allow you to customize the regrouping logic.
This preset is just a predefined a combination of those methods.


Expand Down

0 comments on commit 0f2f699

Please sign in to comment.