Examples | Model Card | Playground Waitlist
Bark is a transformer-based text-to-audio model created by Suno. Bark can generate highly realistic, multilingual speech as well as other audio - including music, background noise and simple sound effects. The model can also produce nonverbal communications like laughing, sighing and crying. To support the research community, we are providing access to pretrained model checkpoints ready for inference.
This repo is a fastAPI Endpoint for Bark. Link to orignal repo
- Clone the repo
- run docker cmd
docker build -t <name_of_image> .
- after the build is complete run
docker run -p 9000:9000 bark_api
to start the container - The Endpoint is exposed at port
9000
by default (use postman of equivalent for inference.) - The inference will return a Audio file (FileResponse class)