Cantai API

Integrate professional AI singing into your applications

Transform Your Applications

🎮

Singing Games

Create karaoke games, rhythm games, or music education apps with real-time vocal synthesis

🎵

Music Production

Build music streaming apps, vocal synthesis services, or voice customization platforms

🎓

Education

Build interactive music theory tools, ear training apps, or vocal harmony visualizers

🎙️

Content Creation

Generate singing vocals for podcasts, videos, or interactive media projects

Universal Input Support

Work with your preferred music formats

Standard Notation

  • MIDI Files
  • ABC Notation Files
  • Lilypond Files
  • MusicXML Files

Developer Friendly

  • JSON Score Format
  • YAML Configuration
  • CANTAI Format Files
  • Plain Text Lyrics

Advanced Features

  • Batch Processing
  • Voice Cloning
  • Style Transfer

Voice Output Formats

  • WAV Audio Files
  • OPUS Format
  • OGG Format
  • MP3 Format

Quick Start

song.yaml
song:
  notes: ["C4", "E4", "G4"]
  lyrics: ["Let's", "play", "now"]
  tempo: 120
fetch.js
const response = await fetch('https://cantai.app/v1/synthesize', {
  method: 'POST',
  headers: {
    'Authorization': 'Bearer YOUR_API_KEY'
  },
  body: JSON.stringify(songData)
});
curl
curl -X POST https://cantai.app/v1/synthesize \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d @song.yaml

High Performance

Process multiple voices simultaneously with our multi-threaded rendering engine for real-time synthesis and batch processing

🎤

Voice Types

  • High Soprano (Operatic)
  • Mezzo-Soprano
  • Tenor & Bass
  • Classical, Opera & Gospel Choir
  • Children's Choir
  • Custom Models (On Request)
☁️

Cloud Storage

Automatic file handling with secure cloud storage integration and version control for all your voice synthesis projects

Core Endpoints

POST /v1/synthesize

Transform any musical score into studio-quality AI vocals

Parameters:

  • score - Musical score file (CANTAI, MusicXML, or MIDI)
  • voice - Voice model identifier
  • format - Output format (wav, ogg, opus)
  • api_key - Your API key
POST /v1/ensemble

Create rich, multi-voice arrangements with professional studio acoustics

Parameters:

  • score - Source musical score
  • ensemble_size - Number of voices (1-16)
  • acoustics - Acoustic environment settings
  • blend - Voice blending parameters
POST /v1/score/analyze

Advanced score analysis with automatic phoneme mapping and voice optimization

Features:

  • Intelligent phoneme detection
  • Multi-language support
  • Automatic voice part assignment
  • Performance optimization suggestions

Simple, Usage-Based Pricing

Developer

$20/100 songs
  • Pay As You Go
  • No Monthly Minimum
  • Basic Support
Start Free

Have Questions?

Join our Discord community to learn more about the API

Join #api-info on Discord

⚠️ API details and supported formats may change without notice before our March 1 launch. Check back for updates.

Subscribe to Our Newsletter

* indicates required