lengyue233/content-vec-best

Name: lengyue233/content-vec-best
Rating: 5 (22 reviews)
Author: lengyue233

transformerstransformerspytorchhubertdoi:10.57967/hf/0479license:mitendpoints_compatiblemit

22

HuggingFace

913.6K

Content Vec Best

Official Repo: ContentVec
This repo brings fairseq ContentVec model to HuggingFace Transformers.

How to use

To use this model, you need to define

from transformers import HubertModel
import torch.nn as nn
class HubertModelWithFinalProj(HubertModel):
    def __init__(self, config):
        super().__init__(config)

        # The final projection layer is only used for backward compatibility.
        # Following https://github.com/auspicious3000/contentvec/issues/6
        # Remove this layer is necessary to achieve the desired outcome.
        self.final_proj = nn.Linear(config.hidden_size, config.classifier_proj_size)

and then load the model with

audio = torch.randn(1, 16000)

model = HubertModelWithFinalProj.from_pretrained("lengyue233/content-vec-best")

x = model(audio)["last_hidden_state"]

How to convert

You need to download the ContentVec_legacy model from the official repo, and then run

python convert.py

Deploy Model on Runcrate

Run this model on powerful GPU infrastructure. Deploy in 60 seconds.

Pay per second

H100, A100, RTX GPUs

Instant deployment

DEPLOY IN 60 SECONDS

Run content-vec-best on Runcrate

Deploy on H100, A100, or RTX GPUs. Pay only for what you use. No setup required.