SAM Audio icon
SAM Audio icon

SAM Audio

SAM-Audio is a foundation model for isolating any sound in audio using text, visual, or temporal prompts. It can separate specific sounds from complex audio mixtures based on natural language descriptions, visual cues from video, or time spans.

SAM Audio screenshot 1

Cost / License

  • Free
  • Open Source

Platforms

  • Online
  • Self-Hosted
  • Python
-
No reviews
0likes
0comments
0news articles

Features

Suggest and vote on features
No features, maybe you want to suggest one?

 Tags

SAM Audio News & Activities

Highlights All activities

Recent News

Show more news

Recent activities

Show all activities

SAM Audio information

  • Developed by

    US flagMeta
  • Licensing

    Open Source and Free product.
  • Written in

  • Alternatives

    26 alternatives listed
  • Supported Languages

    • English

AlternativeTo Category

AI Tools & Services

GitHub repository

  •  741 Stars
  •  56 Forks
  •  6 Open Issues
  •   Updated  
View on GitHub
SAM Audio was added to AlternativeTo by Paul on and this page was last updated .
No comments or reviews, maybe you want to be first?
Post comment/review

What is SAM Audio?

SAM-Audio is a foundation model for isolating any sound in audio using text, visual, or temporal prompts. It can separate specific sounds from complex audio mixtures based on natural language descriptions, visual cues from video, or time spans.

SAM-Audio supports three types of prompting: text, visual, and span. Each method allows you to specify which sounds to isolate in different ways.

  1. Text Prompting: Use natural language descriptions to isolate sounds.
  2. Visual Prompting: Isolate sounds associated with specific visual objects in a video using masked video frames.
  3. Span Prompting (Temporal Anchors): Specify time ranges where the target sound occurs or doesn't occur. This provides a specific example to the model of what to isolate.

Official Links