Toxic Prompt RoBERTa

A text classification model that can be used as a guardrail to protect against toxic prompts and responses in conversational AI systems.

Cost / License

  • Free
  • Open Source

Platforms

  • Self-Hosted
-
No reviews
0likes
0comments
0news articles

Features

Suggest and vote on features
  1.  AI-Powered

 Tags

  • safeguarding
  • ai-safety
  • safety
  • safety-management
  • ai-guardrails
  • huggingface

Toxic Prompt RoBERTa News & Activities

Highlights All activities

Recent activities

Show all activities

Toxic Prompt RoBERTa information

  • Developed by

    US flagIntel
  • Licensing

    Open Source (MIT) and Free product.
  • Alternatives

    3 alternatives listed
  • Supported Languages

    • English

AlternativeTo Category

AI Tools & Services
Toxic Prompt RoBERTa was added to AlternativeTo by Paul on and this page was last updated .
No comments or reviews, maybe you want to be first?
Post comment/review

What is Toxic Prompt RoBERTa?

Toxic Prompt RoBERTa 1.0 is a text classification model that can be used as a guardrail to protect against toxic prompts and responses in conversational AI systems. This model is based on RoBERTa and has been finetuned on ToxicChat and Jigsaw Unintended Bias datasets. Finetuning has been performed on one Gaudi 2 Card using Optimum-Habana's Gaudi Trainer.

Official Links