p 
 smart-turn-v2
Voice Activity Detection • pipecat-aiAn open source, community-driven, native audio turn detection model in 2nd version
| Model Info | |
|---|---|
| Batch | Yes | 
| Unit Pricing | $0.00034 per audio minute | 
Parameters
* indicates a required field
Input
-  0object-  audioobject requiredreadable stream with audio data and content-type specified for that data -  bodyobject required
-  contentTypestring required
 
-  
-  dtypestringtype of data PCM data that's sent to the inference server as raw array 
 
-  
-  1object-  audiostring requiredbase64 encoded audio data 
-  dtypestringtype of data PCM data that's sent to the inference server as raw array 
 
-  
Output
-  is_completebooleanif true, end-of-turn was detected 
-  probabilitynumberprobability of the end-of-turn detection 
API Schemas
The following schemas are based on JSON Schema
{    "type": "object",    "oneOf": [        {            "properties": {                "audio": {                    "type": "object",                    "description": "readable stream with audio data and content-type specified for that data",                    "properties": {                        "body": {                            "type": "object"                        },                        "contentType": {                            "type": "string"                        }                    },                    "required": [                        "body",                        "contentType"                    ]                },                "dtype": {                    "type": "string",                    "description": "type of data PCM data that's sent to the inference server as raw array",                    "enum": [                        "uint8",                        "float32",                        "float64"                    ]                }            },            "required": [                "audio"            ]        },        {            "properties": {                "audio": {                    "type": "string",                    "description": "base64 encoded audio data"                },                "dtype": {                    "type": "string",                    "description": "type of data PCM data that's sent to the inference server as raw array",                    "enum": [                        "uint8",                        "float32",                        "float64"                    ]                }            },            "required": [                "audio"            ]        }    ]}{    "type": "object",    "contentType": "application/json",    "properties": {        "is_complete": {            "type": "boolean",            "description": "if true, end-of-turn was detected"        },        "probability": {            "type": "number",            "description": "probability of the end-of-turn detection"        }    }}Was this helpful?
- Resources
- API
- New to Cloudflare?
- Directory
- Sponsorships
- Open Source
- Support
- Help Center
- System Status
- Compliance
- GDPR
- Company
- cloudflare.com
- Our team
- Careers
- © 2025 Cloudflare, Inc.
- Privacy Policy
- Terms of Use
- Report Security Issues
- Trademark