ProsodyConfig
Bases: ModelConfigBase['ProsodyConfig']
Configuration for the speech prosody model.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
granularity |
Optional[str]
|
The granularity at which to generate predictions.
Accepted values are |
None
|
identify_speakers |
Optional[bool]
|
Whether to return identifiers for speakers over time. If true, unique identifiers will be assigned to spoken words to differentiate different speakers. If false, all speakers will be tagged with an "unknown" ID. This configuration is only available for the batch API. |
None
|
window |
Optional[Dict[str, float]]
|
Sliding window used to chunk audio.
This dictionary input takes two entries: |
None
|
Source code in hume/models/config/prosody_config.py
get_model_type()
classmethod
Get the configuration model type.
Returns:
Name | Type | Description |
---|---|---|
ModelType |
ModelType
|
Model type. |