roberta-base-openai-detector

2023-12-26 13:47 0 微浪网 0

AIGC网址导航

导语： RoBERTa Base OpenAI Detecto...,

RoBERTa Base OpenAI Detector

Table of Contents

Model Details

Uses

Risks, Limitations and Biases

Training

Evaluation

Environmental Impact

Technical Specifications

Citation Information

Model Card Authors

How To Get Started With the Model

Model Details

Model Description: RoBERTa base OpenAI Detector is the GPT-2 output detector model, obtained by fine-tuning a RoBERTa base model with the outputs of the 1.5B-parameter GPT-2 model. The model can be used to predict if text was generated by a GPT-2 model. This model was released by OpenAI at the same time as OpenAI released the weights of the largest GPT-2 model, the 1.5B parameter version.

Developed by: OpenAI, see GitHub Repo and associated paper for full author list

Model Type: Fine-tuned transformer-based language model

Language(s): English

License: MIT

Related Models: RoBERTa base, GPT-XL (1.5B parameter version), GPT-Large (the 774M parameter version), GPT-Medium (the 355M parameter version) and GPT-2 (the 124M parameter version)

Resources for more information:

Research Paper (see, in particular, the section beginning on page 12 about Automated ML-based detection).

GitHub Repo

OpenAI Blog Post

Explore the detector model here

Uses

Direct Use

The model is a classifier that can be used to detect text generated by GPT-2 models. However, it is strongly suggested not to use it as a ChatGPT detector for the purposes of making grave allegations of academic misconduct against undergraduates and others, as this model might give inaccurate results in the case of ChatGPT-generated input.

Downstream Use

The model’s developers have stated that they developed and released the model to help with research related to synthetic text generation, so the model could potentially be used for downstream tasks related to synthetic text generation. See the associated paper for further discussion.

Misuse and Out-of-scope Use

The model should not be used to intentionally create hostile or alienating environments for people. In addition, the model developers discuss the risk of adversaries using the model to better evade detection in their associated paper, suggesting that using the model for evading detection or for supporting efforts to evade detection would be a misuse of the model.

Risks, Limitations and Biases

CONTENT WARNING: Readers should be aware this section may contain content that is disturbing, offensive, and can propagate historical and current stereotypes.
Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model.

Risks and Limitations

In their associated paper, the model developers discuss the risk that the model may be used by bad actors to develop capabilities for evading detection, though one purpose of releasing the model is to help improve detection research.
In a related blog post, the model developers also discuss the limitations of automated methods for detecting synthetic text and the need to pair automated detection tools with other, non-automated approaches. They write:

We conducted in-house detection research and developed a detection model that has detection rates of ~95% for detecting 1.5B GPT-2-generated text. We believe this is not high enough accuracy for standalone detection and needs to be paired with metadata-based approaches, human judgment, and public education to be more effective.

The model developers also report finding that classifying content from larger models is more difficult, suggesting that detection with automated tools like this model will be increasingly difficult as model sizes increase. The authors find that training detector models on the outputs of larger models can improve accuracy and robustness.

Bias

Significant research has explored bias and fairness issues with language models (see, e.g., Sheng et al. (2021) and Bender et al. (2021)). Predictions generated by RoBERTa base and GPT-2 1.5B (which this model is built/fine-tuned on) can include disturbing and harmful stereotypes across protected classes; identity characteristics; and sensitive, social, and occupational groups (see the RoBERTa base and GPT-2 XL model cards for more information). The developers of this model discuss these issues further in their paper.

Training

Training Data

The model is a sequence classifier based on RoBERTa base (see the RoBERTa base model card for more details on the RoBERTa base training data) and then fine-tuned using the outputs of the 1.5B GPT-2 model (available here).

Training Procedure

The model developers write that:

We based a sequence classifier on RoBERTaBASE (125 million parameters) and fine-tuned it to classify the outputs from the 1.5B GPT-2 model versus WebText, the dataset we used to train the GPT-2 model.

They later state:

To develop a robust detector model that can accurately classify generated texts regardless of the sampling method, we performed an analysis of the model’s transfer performance.

See the associated paper for further details on the training procedure.

Evaluation

The following evaluation information is extracted from the associated paper.

Testing Data, Factors and Metrics

The model is intended to be used for detecting text generated by GPT-2 models, so the model developers test the model on text datasets, measuring accuracy by:

testing 510-token test examples comprised of 5,000 samples from the WebText dataset and 5,000 samples generated by a GPT-2 model, which were not used during the training.

Results

The model developers find:

Our classifier is able to detect 1.5 billion parameter GPT-2-generated text with approximately 95% accuracy…The model’s accuracy depends on sampling methods used when generating outputs, like temperature, Top-K, and nucleus sampling (Holtzman et al., 2019. Nucleus sampling outputs proved most difficult to correctly classify, but a detector trained using nucleus sampling transfers well across other sampling methods. As seen in Figure 1 [in the paper], we found consistently high accuracy when trained on nucleus sampling.

See the associated paper, Figure 1 (on page 14) and Figure 2 (on page 16) for full results.

Environmental Impact

Carbon emissions can be estimated using the Machine Learning Impact calculator presented in Lacoste et al. (2019).

Hardware Type: Unknown

Hours used: Unknown

Cloud Provider: Unknown

Compute Region: Unknown

Carbon Emitted: Unknown

Technical Specifications

The model developers write that:
See the associated paper for further details on the modeling architecture and training details.

Citation Information

@article{solaiman2019release, title={Release strategies and the social impacts of language models}, author={Solaiman, Irene and Brundage, Miles and Clark, Jack and Askell, Amanda and Herbert-Voss, Ariel and Wu, Jeff and Radford, Alec and Krueger, Gretchen and Kim, Jong Wook and Kreps, Sarah and others}, journal={arXiv preprint arXiv:1908.09203}, year={2019} } 

APA:

Solaiman, I., Brundage, M., Clark, J., Askell, A., Herbert-Voss, A., Wu, J., … & Wang, J. (2019). Release strategies and the social impacts of language models. arXiv preprint arXiv:1908.09203.

Model Card Authors

This model card was written by the team at Hugging Face.

How to Get Started with the Model

This model can be instantiated and run with a Transformers pipeline:
from transformers import pipeline pipe = pipeline("text-classification", model="roberta-base-openai-detector") print(pipe("Hello world! Is this content AI-generated?")) # [{'label': 'Real', 'score': 0.8036582469940186}] 

收录说明：
1、本网页并非 roberta-base-openai-detector 官网网址页面，此页面内容编录于互联网，只作展示之用；2、如果有与 roberta-base-openai-detector 相关业务事宜，请访问其网站并获取联系方式；3、本站与 roberta-base-openai-detector 无任何关系，对于 roberta-base-openai-detector 网站中的信息，请用户谨慎辨识其真伪。4、本站收录 roberta-base-openai-detector 时，此站内容访问正常，如遇跳转非法网站，有可能此网站被非法入侵或者已更换新网址，导致旧网址被非法使用,5、如果你是网站站长或者负责人，不想被收录请邮件删除：i-hu#Foxmail.com （#换@）

前往AI网址导航

1、本文来自 AIGC网址导航 投稿的内容 roberta-base-openai-detector ，所有言论和图片纯属作者个人意见，版权归原作者所有；不代表本站立场；
2、本站所有文章、图片、资源等如果未标明原创，均为收集自互联网公开资源；分享的图片、资源、视频等，出镜模特均为成年女性正常写真内容，版权归原作者所有，仅作为个人学习、研究以及欣赏！如有涉及下载请24小时内删除；
3、如果您发现本站上有侵犯您的权益的作品，请与我们取得联系，我们会及时修改、删除并致以最深的歉意。邮箱： i-hu#（#换@）foxmail.com

roberta-base-openai-detector

RoBERTa Base OpenAI Detector

Table of Contents

Model Details

Uses

Direct Use

Downstream Use

Misuse and Out-of-scope Use

Risks, Limitations and Biases

Risks and Limitations

Bias

Training

Training Data

Training Procedure

Evaluation

Testing Data, Factors and Metrics

Results

Environmental Impact

Technical Specifications

Citation Information

Model Card Authors

How to Get Started with the Model

猜你喜欢