Toxic Prompt RoBERTa

A text classification model that can be used as a guardrail to protect against toxic prompts and responses in conversational AI systems.

Cost / License

Free
Open Source (MIT)

Application type

Large Language Model (LLM) Tool

Origin

United States

Platforms

Self-Hosted

Alternatives

0likes

0comments

3alternatives

0articles

Features

AI-Powered

Toxic Prompt RoBERTa News & Activities

Highlights All activities

Recent activities

No activities found.

Toxic Prompt RoBERTa information

Developed by
Intel
Licensing
Open Source (MIT) and Free product.
Alternatives
3 alternatives listed
Supported Languages
- English

AlternativeTo Category

AI Tools & Services

Popular alternatives

View all

Toxic Prompt RoBERTa was added to AlternativeTo by Paul on Mar 12, 2025 and this page was last updated Mar 12, 2025.

No comments or reviews, maybe you want to be first?

What is Toxic Prompt RoBERTa?

Toxic Prompt RoBERTa 1.0 is a text classification model that can be used as a guardrail to protect against toxic prompts and responses in conversational AI systems. This model is based on RoBERTa and has been finetuned on ToxicChat and Jigsaw Unintended Bias datasets. Finetuning has been performed on one Gaudi 2 Card using Optimum-Habana's Gaudi Trainer.

Toxic Prompt RoBERTa

Cost / License

Application type

Origin

Platforms

Toxic Prompt RoBERTa

Features

Tags

Toxic Prompt RoBERTa News & Activities

Recent activities

Toxic Prompt RoBERTa information

Developed by

Licensing

Alternatives

Supported Languages

AlternativeTo Category

Popular alternatives

What is Toxic Prompt RoBERTa?

Official Links

AppStores & Other Links

Social Networks