Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

yapwithai
/
kyutai-stt-1b-en_fr

Automatic Speech Recognition
Moshi
Safetensors
English
French
stt
audio
Model card Files Files and versions
xet
Community

Instructions to use yapwithai/kyutai-stt-1b-en_fr with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

  • Libraries
  • Moshi

    How to use yapwithai/kyutai-stt-1b-en_fr with Moshi:

    # pip install moshi
    # Run the interactive web server
    python -m moshi.server --hf-repo "yapwithai/kyutai-stt-1b-en_fr"
    # Then open https://localhost:8998 in your browser
    # pip install moshi
    import torch
    from moshi.models import loaders
    
    # Load checkpoint info from HuggingFace
    checkpoint = loaders.CheckpointInfo.from_hf_repo("yapwithai/kyutai-stt-1b-en_fr")
    
    # Load the Mimi audio codec
    mimi = checkpoint.get_mimi(device="cuda")
    mimi.set_num_codebooks(8)
    
    # Encode audio (24kHz, mono)
    wav = torch.randn(1, 1, 24000 * 10)  # [batch, channels, samples]
    with torch.no_grad():
        codes = mimi.encode(wav.cuda())
        decoded = mimi.decode(codes)
  • Notebooks
  • Google Colab
  • Kaggle
kyutai-stt-1b-en_fr
2.36 GB
Ctrl+K
Ctrl+K
  • 2 contributors
History: 5 commits
lmz's picture
lmz
eustlb's picture
eustlb HF Staff
transformers support (#5)
40b0340 verified 11 months ago
  • .gitattributes
    1.52 kB
    initial commit 12 months ago
  • README.md
    3.79 kB
    transformers support (#5) 11 months ago
  • config.json
    1.26 kB
    Update config.json 12 months ago
  • mimi-pytorch-e351c8d8@125.safetensors
    385 MB
    xet
    Upload folder using huggingface_hub 12 months ago
  • model.safetensors
    1.98 GB
    xet
    Upload folder using huggingface_hub 12 months ago
  • tokenizer_en_fr_audio_8000.model
    120 kB
    xet
    Upload folder using huggingface_hub 12 months ago