What will the false negative rate of the LLM detector 'Binoculars' be in my personal testing, for 100-200 word texts?
What will the false negative rate of the LLM detector 'Binoculars' be in my personal testing, for 100-200 word texts?
2
190Ṁ12k
resolved Jan 24
100%100.0%
0-1%
0.0%
1-5%
0.0%
5-10%
0.0%
10-25%
0.0%
25-50%
0.0%
50-100%

https://huggingface.co/spaces/tomg-group-umd/Binoculars

Over a wide range of document types, Binoculars detects over 90% of generated samples from ChatGPT (and other LLMs) at a false positive rate of 0.01%, despite not being trained on any ChatGPT data

Is there a correlation between Binoculars score and sequence length? Such correlations may create a bias towards incorrect results for certain lengths. In Figure 12, we show the joint distribution of token sequence length and Binocular score. Sequence length offers little information about class membership

However, just by eyeballing the graph in figure 12 the false positive rate will probably be somewhat higher for 100-200 word texts. I'm using that length because that's the rough length of text I'd use the tool on, though.

I'll run it on at least 100 pieces of gpt4-generated text that I generate, and at least 100 pieces of non-llm text from around the web that I browse. Both generations and human text will be the kinds of things I'd usually generate or read myself, which may have different properties than what the paper used.

Get
Ṁ1,000
to start trading!

🏅 Top traders

#NameTotal profit
1Ṁ134
2Ṁ54

What is this?

What is Manifold?
Manifold is the world's largest social prediction market.
Get accurate real-time odds on politics, tech, sports, and more.
Or create your own play-money betting market on any question you care about.
Are our predictions accurate?
Yes! Manifold is very well calibrated, with forecasts on average within 4 percentage points of the true probability. Our probabilities are created by users buying and selling shares of a market.
In the 2022 US midterm elections, we outperformed all other prediction market platforms and were in line with FiveThirtyEight’s performance. Many people who don't like betting still use Manifold to get reliable news.
ṀWhy use play money?
Mana (Ṁ) is the play-money currency used to bet on Manifold. It cannot be converted to cash. All users start with Ṁ1,000 for free.
Play money means it's much easier for anyone anywhere in the world to get started and try out forecasting without any risk. It also means there's more freedom to create and bet on any type of question.
© Manifold Markets, Inc.Terms + Mana-only TermsPrivacyRules