

SAM Audio
Like
SAM-Audio is a foundation model for isolating any sound in audio using text, visual, or temporal prompts. It can separate specific sounds from complex audio mixtures based on natural language descriptions, visual cues from video, or time spans.
Cost / License
- Free
- Open Source
Platforms
- Online
- Self-Hosted
- Python
Features
No features, maybe you want to suggest one?
Tags
- ai-model
- vocal-isolation
- music-separation
- Audio Processing
- foundation-models
SAM Audio News & Activities
Highlights All activities
Recent News
- POX published news article about SAM Audio
Meta launches SAM Audio, an AI model for intuitive sound segmentation and isolationMeta has launched SAM Audio, a state-of-the-art artificial intelligence model that brings advanced ...
Recent activities
SAM Audio information
No comments or reviews, maybe you want to be first?
Post comment/reviewWhat is SAM Audio?
SAM-Audio is a foundation model for isolating any sound in audio using text, visual, or temporal prompts. It can separate specific sounds from complex audio mixtures based on natural language descriptions, visual cues from video, or time spans.
SAM-Audio supports three types of prompting: text, visual, and span. Each method allows you to specify which sounds to isolate in different ways.
- Text Prompting: Use natural language descriptions to isolate sounds.
- Visual Prompting: Isolate sounds associated with specific visual objects in a video using masked video frames.
- Span Prompting (Temporal Anchors): Specify time ranges where the target sound occurs or doesn't occur. This provides a specific example to the model of what to isolate.








