Skip to content
Draft
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
43 commits
Select commit Hold shift + click to select a range
d654cb9
fix(cohere): Implement stateless decoder to fix cache repetition bug
Alex-Wengg Apr 6, 2026
a5d1fc0
docs(cohere): Update README with stateless decoder status and complet…
Alex-Wengg Apr 6, 2026
079ec18
chore(cohere): Remove broken/experimental export scripts
Alex-Wengg Apr 6, 2026
9fe6808
chore(cohere): Delete archive and mark broken HF uploads
Alex-Wengg Apr 6, 2026
2e96dce
refactor(cohere): Clean up test suite and add PyTorch baseline
Alex-Wengg Apr 6, 2026
2994397
chore(cohere): Delete redundant utility scripts
Alex-Wengg Apr 6, 2026
9aad907
feat(cohere): implement stateful decoder with Qwen3 approach
Alex-Wengg Apr 6, 2026
068c718
test(cohere): add comprehensive benchmarks for stateful decoder
Alex-Wengg Apr 6, 2026
3d096ef
chore: remove outdated debug scripts, logs, and reference code
Alex-Wengg Apr 6, 2026
1ae8422
feat(cohere): add 256-token decoder and investigation scripts
Alex-Wengg Apr 6, 2026
98ea02b
docs(cohere): Identify encoder as root cause of quality issues
Alex-Wengg Apr 6, 2026
947058a
docs(cohere): Complete root cause analysis - encoder training data bias
Alex-Wengg Apr 6, 2026
3329c99
fix(cohere): Correct audio window to 35 seconds (3500 frames)
Alex-Wengg Apr 6, 2026
c7a4db8
docs(cohere): Document .mlpackage requirement and .mlmodelc limitations
Alex-Wengg Apr 6, 2026
8f0bf24
docs(cohere): Update README with current status and .mlpackage requir…
Alex-Wengg Apr 6, 2026
bb89d2d
chore(cohere): Clean up obsolete files and failed experiments
Alex-Wengg Apr 6, 2026
b008c99
chore(cohere): Remove remaining obsolete files
Alex-Wengg Apr 6, 2026
6eba9b1
chore(cohere): Remove temporary upload docs and obsolete tests
Alex-Wengg Apr 6, 2026
ca4aab1
chore(cohere): Remove obsolete build artifacts and test files
Alex-Wengg Apr 6, 2026
cfb3ecb
refactor(cohere): Organize original PyTorch files into cohere-pytorch…
Alex-Wengg Apr 6, 2026
13f9535
docs(cohere): Add historical context and verified performance results
Alex-Wengg Apr 6, 2026
36835ed
feat(cohere): Add INT8 quantized models and benchmarks
Alex-Wengg Apr 6, 2026
4cbd37d
refactor(cohere): Reorganize scripts and create unified benchmark tool
Alex-Wengg Apr 6, 2026
fcc47a2
refactor(cohere): Use jiwer library for text normalization
Alex-Wengg Apr 6, 2026
0790b6c
fix(cohere): Use google/fleurs dataset with correct field names
Alex-Wengg Apr 6, 2026
fc3c20b
refactor(cohere): Organize test files and scripts
Alex-Wengg Apr 6, 2026
c7e0b11
docs(cohere): Add comprehensive research analysis and limitations
Alex-Wengg Apr 6, 2026
e56f48d
feat(cohere): Add stateless decoder variant (Parakeet approach)
Alex-Wengg Apr 6, 2026
7c088a3
docs(cohere): Add FP16 vs INT8 FLEURS comparison analysis
Alex-Wengg Apr 7, 2026
e9f9973
feat(cohere): Add INT4 quantization experiments and comprehensive res…
Alex-Wengg Apr 7, 2026
887b22b
fix(cohere): Address critical Devin review issues
Alex-Wengg Apr 7, 2026
395e48a
fix(cohere): Fix test file issues from Devin review
Alex-Wengg Apr 7, 2026
f81dfb7
fix(cohere): Fix stateful decoder export issues from Devin review
Alex-Wengg Apr 7, 2026
8c95861
fix(cohere): Commit uv.lock for reproducibility
Alex-Wengg Apr 7, 2026
1edbc01
chore(cohere): Add test results and cache to gitignore
Alex-Wengg Apr 7, 2026
306a283
refactor(cohere): Centralize test scripts into tests/ directory
Alex-Wengg Apr 7, 2026
6209f8a
refactor(cohere): Move benchmark scripts to tests/ directory
Alex-Wengg Apr 7, 2026
5d12a80
Fix EOS token detection in cache-external decoder
Alex-Wengg Apr 8, 2026
e007570
Verify .mlmodelc compilation for Swift integration
Alex-Wengg Apr 8, 2026
049382a
docs: Add sync status for mobius ↔ FluidAudio updates
Alex-Wengg Apr 8, 2026
073d7a2
docs: Document Swift benchmark attempt and model compatibility issues
Alex-Wengg Apr 8, 2026
34a6bfb
prep: HuggingFace upload package for cache-external decoder
Alex-Wengg Apr 8, 2026
e9286d1
research: Comprehensive investigation of Cohere multilingual ASR failure
Alex-Wengg Apr 9, 2026
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
12 changes: 11 additions & 1 deletion .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -9,4 +9,14 @@ __pycache__
*.mlmodelc

# Large numpy arrays (exported constants - regenerate via export_constants.py)
*.npy
*.npy

# PyTorch model weights (download from HuggingFace)
*.safetensors
*.bin
*.pt
*.pth
*.ckpt

# ONNX models
*.onnx
Original file line number Diff line number Diff line change
@@ -0,0 +1,99 @@
- dataset:
id: hf-audio/open-asr-leaderboard
task_id: mean_wer
value: 5.42
date: '2026-03-24'
source:
url: https://huggingface.co/hf-audio
name: open-asr-leaderboard
user: hf-audio

- dataset:
id: hf-audio/open-asr-leaderboard
task_id: rtfx
value: 524.88
date: '2026-03-24'
source:
url: https://huggingface.co/hf-audio
name: open-asr-leaderboard
user: hf-audio

- dataset:
id: hf-audio/open-asr-leaderboard
task_id: ami_wer
value: 8.13
date: '2026-03-24'
source:
url: https://huggingface.co/hf-audio
name: open-asr-leaderboard
user: hf-audio

- dataset:
id: hf-audio/open-asr-leaderboard
task_id: earnings22_wer
value: 10.86
date: '2026-03-24'
source:
url: https://huggingface.co/hf-audio
name: open-asr-leaderboard
user: hf-audio

- dataset:
id: hf-audio/open-asr-leaderboard
task_id: gigaspeech_wer
value: 9.34
date: '2026-03-24'
source:
url: https://huggingface.co/hf-audio
name: open-asr-leaderboard
user: hf-audio

- dataset:
id: hf-audio/open-asr-leaderboard
task_id: librispeech_clean_wer
value: 1.25
date: '2026-03-24'
source:
url: https://huggingface.co/hf-audio
name: open-asr-leaderboard
user: hf-audio

- dataset:
id: hf-audio/open-asr-leaderboard
task_id: librispeech_other_wer
value: 2.37
date: '2026-03-24'
source:
url: https://huggingface.co/hf-audio
name: open-asr-leaderboard
user: hf-audio

- dataset:
id: hf-audio/open-asr-leaderboard
task_id: spgispeech_wer
value: 3.08
date: '2026-03-24'
source:
url: https://huggingface.co/hf-audio
name: open-asr-leaderboard
user: hf-audio

- dataset:
id: hf-audio/open-asr-leaderboard
task_id: tedlium_wer
value: 2.49
date: '2026-03-24'
source:
url: https://huggingface.co/hf-audio
name: open-asr-leaderboard
user: hf-audio

- dataset:
id: hf-audio/open-asr-leaderboard
task_id: voxpopuli_wer
value: 5.87
date: '2026-03-24'
source:
url: https://huggingface.co/hf-audio
name: open-asr-leaderboard
user: hf-audio
37 changes: 37 additions & 0 deletions models/stt/cohere-transcribe-03-2026/cohere-pytorch/.gitattributes
Original file line number Diff line number Diff line change
@@ -0,0 +1,37 @@
*.7z filter=lfs diff=lfs merge=lfs -text
Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

no lfs pls. do not commit here

*.arrow filter=lfs diff=lfs merge=lfs -text
*.bin filter=lfs diff=lfs merge=lfs -text
*.bz2 filter=lfs diff=lfs merge=lfs -text
*.ckpt filter=lfs diff=lfs merge=lfs -text
*.ftz filter=lfs diff=lfs merge=lfs -text
*.gz filter=lfs diff=lfs merge=lfs -text
*.h5 filter=lfs diff=lfs merge=lfs -text
*.joblib filter=lfs diff=lfs merge=lfs -text
*.lfs.* filter=lfs diff=lfs merge=lfs -text
*.mlmodel filter=lfs diff=lfs merge=lfs -text
*.model filter=lfs diff=lfs merge=lfs -text
*.msgpack filter=lfs diff=lfs merge=lfs -text
*.npy filter=lfs diff=lfs merge=lfs -text
*.npz filter=lfs diff=lfs merge=lfs -text
*.onnx filter=lfs diff=lfs merge=lfs -text
*.ot filter=lfs diff=lfs merge=lfs -text
*.parquet filter=lfs diff=lfs merge=lfs -text
*.pb filter=lfs diff=lfs merge=lfs -text
*.pickle filter=lfs diff=lfs merge=lfs -text
*.pkl filter=lfs diff=lfs merge=lfs -text
*.pt filter=lfs diff=lfs merge=lfs -text
*.pth filter=lfs diff=lfs merge=lfs -text
*.rar filter=lfs diff=lfs merge=lfs -text
*.safetensors filter=lfs diff=lfs merge=lfs -text
saved_model/**/* filter=lfs diff=lfs merge=lfs -text
*.tar.* filter=lfs diff=lfs merge=lfs -text
*.tar filter=lfs diff=lfs merge=lfs -text
*.tflite filter=lfs diff=lfs merge=lfs -text
*.tgz filter=lfs diff=lfs merge=lfs -text
*.wasm filter=lfs diff=lfs merge=lfs -text
*.xz filter=lfs diff=lfs merge=lfs -text
*.zip filter=lfs diff=lfs merge=lfs -text
*.zst filter=lfs diff=lfs merge=lfs -text
*tfevents* filter=lfs diff=lfs merge=lfs -text
*.wav filter=lfs diff=lfs merge=lfs -text
assets/*.png filter=lfs diff=lfs merge=lfs -text
Loading