Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

tomaarsen
/
reranker-distilroberta-base-quora-duplicates

Text Classification
sentence-transformers
Safetensors
English
roberta
cross-encoder
Generated from Trainer
dataset_size:404290
loss:BinaryCrossEntropyLoss
Eval Results (legacy)
text-embeddings-inference
Model card Files Files and versions
xet
Community
1

Instructions to use tomaarsen/reranker-distilroberta-base-quora-duplicates with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

  • Libraries
  • sentence-transformers

    How to use tomaarsen/reranker-distilroberta-base-quora-duplicates with sentence-transformers:

    from sentence_transformers import CrossEncoder
    
    model = CrossEncoder("tomaarsen/reranker-distilroberta-base-quora-duplicates")
    
    query = "Which planet is known as the Red Planet?"
    passages = [
    	"Venus is often called Earth's twin because of its similar size and proximity.",
    	"Mars, known for its reddish appearance, is often referred to as the Red Planet.",
    	"Jupiter, the largest planet in our solar system, has a prominent red spot.",
    	"Saturn, famous for its rings, is sometimes mistaken for the Red Planet."
    ]
    
    scores = model.predict([(query, passage) for passage in passages])
    print(scores)
  • Notebooks
  • Google Colab
  • Kaggle
reranker-distilroberta-base-quora-duplicates
333 MB
Ctrl+K
Ctrl+K
  • 1 contributor
History: 3 commits
tomaarsen's picture
tomaarsen HF Staff
Add auto-generated README
fdb2c71 verified over 1 year ago
  • .gitattributes
    1.52 kB
    initial commit over 1 year ago
  • README.md
    26.6 kB
    Add auto-generated README over 1 year ago
  • config.json
    807 Bytes
    Upload CrossEncoder over 1 year ago
  • merges.txt
    456 kB
    Upload CrossEncoder over 1 year ago
  • model.safetensors
    328 MB
    xet
    Upload CrossEncoder over 1 year ago
  • special_tokens_map.json
    295 Bytes
    Upload CrossEncoder over 1 year ago
  • tokenizer.json
    3.56 MB
    Upload CrossEncoder over 1 year ago
  • tokenizer_config.json
    1.3 kB
    Upload CrossEncoder over 1 year ago
  • vocab.json
    798 kB
    Upload CrossEncoder over 1 year ago