Banner not found: image/banner.png

Monthly TRP Graphs and the BS we get from our overloads

Quick Links to Monthly TRP Results

November 11, 2025 | October 14, 2025 | September 16, 2025 | August 19, 2025 | July 22, 2025 | June 24, 2025 | May 27, 2025 | April 29, 2025


November Factuality and Severity TRP Graphs

Results from November 11, 2025

User Query INA UNS DIS ACC CCA NCP
Julie Bowen Movies/TV Shows Accurate
Tom Welling Accurate
KFC Buffet Utah Inaccurate
Hank Green Pronouns Accurate
Cast of Cyber Tracker 2 Accurate
User Query ACC Low Low+ Med Med+ High ITA
Crab Legs on Sale Filled
VR Arcades in Fort Wayne, IN Filled
Choctaw Casinos & Resorts Filled
Nearest Post Office in WI Filled
The Terrible Trio Filled
Anker Power Bank Filled Filled

October Factuality and Severity TRP Graphs

Results from October 14, 2025

While we're all aware that the Wayback Machine is mentioned in GG as a reputable place to verify the information, it's apparently the end-all as well. The query for Mexican Restaurants in Detroit, had a response date of March 27, 2025. And since the menu for Evie's Tamales in Detroit wasn't available on the Wayback Machine, this was CCA. It's almost like this was a trick to intentionally screw with us.

User Query INA UNS DIS ACC CCA NCP
Does Liberty Tax Offer Tax Preparation Services Inaccurate
Show me the Burger King Inaccurate
Looking for Campgrounds in New Hampshire Accurate
Mexican Restaurants in Detroit Can't Confidently Assess
Little Caesars, McDonald's and Valero in Cairo, GA Inaccurate
Bread Bakeries in or Near Little Chute, WI Accurate
User Query ACC Low Low+ Med Med+ High ITA
Heaven Official's Blessing Season 3 Filled
Poster American Horror Story Coven Filled Filled
Greek Restaurants in Chicago Filled Filled

September Factuality and Severity TRP Graphs

Results from September 16, 2025

User Query INA UNS DIS ACC CCA NCP
Casual Dining Restaurants Near me in Kansas City, MO Accurate
Is Harry Potter's Son a Squib Accurate
Can Babies Get Utis Accurate
Too Much Elderberry Syrup Cause Constipation Accurate
Ski trip near Binghamton, NY Inaccurate
Odd Number Without e Inaccurate
Oregon Law False Claim Inaccurate
User Query ACC Low Low+ Med Med+ High ITA
What are the Top-Rated Cental Clinics in Miami Filled
Best Things to Visit on the Amalfi Coast Filled Filled
Fight or Flight Sequel Filled Filled Filled
Tokyo Car Show June 2025 Filled

August Factuality and Severity TRP Graphs

Results from August 19, 2025

User Query INA UNS DIS ACC CCA NCP
Vice President of the United States Inaccurate
AT&T Stores in Eugene, Springfield Accurate
Find Irish Pubs Near 45th Street in New York City Accurate
Show Me Tattoo Shops in Kansas City Inaccurate
3-month Shot for Bipolar Inaccurate
I Need Ideas for a Last-Minute Romantic Weekend Getaway from San Francisco Inaccurate
User Query ACC Low Low+ Med Med+ High ITA
Cheap Daycare Near Me Filled
Compare Disney's Operating Income with US Pop Change Filled Filled
COPA America 2024 Cities Filled Filled
Does Cinebistro in Miami Offer Reclining Seats Filled Filled
Late-night desserts in Los Angeles Filled Filled
How Long Does it Take to put Stent in Kidney Filled Filled
Vladtv Wife Filled

July Factuality and Severity TRP Graphs

Results from July 22, 2025

Let's get to it, eh. On the Chinese restaurant task we learned not to be literal and to use the context in the response, even though the target sentence is clearly wrong. There is a complete lack of consistency, kind of like Mamdani changing his views on a whim. And we learned that Indiana and Kentucky are the same state, in'nit? There was no hedging in the Kentucky severity task, but they still accepted the Ohio Falls State Park as a good suggestion due to it's proximity to Kentucky, and a reasonable suggestion. Yet, a few months ago with the HP Printer severity, they absolutely refused to consider 27.78 and 27.8 as the same weight. There is no consistency with their ratings on these. The other task which doesn't make sense is Simone Biles. We were told this is a clear-cut inaccuracy, yet the rating only included the low ranges. The medal count that was given was completely wrong and stale, yet they are ignoring this fact. However, they are not ignoring the part about awards in the AMA Awards severity, here it was considered a strict high. How is this more harmful than the stale and inaccurate medal account for Simone?

User Query INA UNS DIS ACC CCA NCP
Food Assistance Resources, Frankfort, Il Inaccurate
Chinese Restaurants Within 1 Mile of Oakmont Accurate
Daily Iron Intake for Men Inaccurate
Relaxing Getaway to Long Beach Island, NJ No Claims
Fine Dining Experience in New York City Accurate
Zanzibar Restaurant or Bar in Denver, CO Inaccurate
Sherburne County Dispatch Non-Emergency Number Inaccurate
User Query ACC Low Low+ Med Med+ High ITA
Things To Do in Kentucky Filled
How Much is Parking at Dulles Airport Filled
Compare Costco and Walmart Stock Price Filled Filled
What is PL 94–142 Mean Filled Filled Filled
Most Awarded Winners at the AMA Filled
Huntington Bank on Triskett Rd in Cleveland Open Filled
Medals has Simone Biles Won in Gymnastics Filled Filled
Top-Rated Dental Clinics in Miami Filled

June Factuality and Severity TRP Graphs

Results from June 24, 2025

They cannot give us consistent guidence on context. This month the problem task was [rp diet coach app cost]. The evidence specifically said "Monthly Subscription: $14.99 or $19.99 per month." This came directly from this header, "According to Sensor Tower, the app's subscription costs are." So one would think you just check the Sensor Tower website and verify, but no that is not correct. This is what we were told Sensor Tower lists several different prices though it is unclear if these are current costs or previous promotional deals. We cannot tell from the website if both prices are currently valid and if they are, what factors affect which price a user might pay. However, the official site for the app does not mention a $14.99 monthly subscription fee. Only the $19.99 subscription fee is mentioned. The claim is marked as inaccurate due to misleading information." So now context isn't enough, we have to verify the context information.

User Query INA UNS DIS ACC CCA NCP
Long Island Restaurant Week Inaccurate
The Proud Family Season 3 Inaccurate
Campgrounds New Hampshire Accurate
Move to Anderson, CA Accurate
Delta Million Mile Gift Inaccurate
SSA Office in Millington, TN Accurate
Walk-in Vet Clinic No Claims
User Query ACC Low Low+ Med Med+ High ITA
Jumpz Trampoline Filled
AAA Claims Filled
Translate a Meeting in Google Meet Filled Filled Filled
Face Moisturizer for Night Time Filled Filled Filled
RP Diet Coach Filled Filled Filled
Hojicha vs Matcha Filled Filled Filled

May Factuality TRP Graph

Results from May 27, 2025

User Query INA UNS DIS ACC CCA NCP
Events Near Me This Weekend Inaccurate
Recs for Bars in Statesville, NC Unsupported
Restaurant Called Grand Grill Inaccurate
Fabian Frankel Accurate
Was Desantis Born Rich Inaccurate
Tailor and Home Depot in Houston No Claims
Meaning of Grandala in Oxford Dictionary Accurate
Highly Rated Halal, NYC No Claims

April Factuality and Severity TRP Graphs

Results from April 29, 2025

The insanity for this month is the email distribution list fed by a Google Sheet. We were told the inaccuracy here is somewhat related to primary user intent. There is a mitigating nuance and context when reading the full instructions for duplicate management in Google Contacts. It is automatically done after you select the option to do it. This makes the inaccuracy of the claim in context less severe, which results in the severity rating of low, so this is a strict low. What regular person is going to care about that "mitigating nuance"? Perfect example of how we don't stand a chance to do well with these subjective tasks.

User Query INA UNS DIS ACC CCA NCP
Disposing of a Deceased Pet Inaccurate
Restaurant recommendations for Dinner, AL No Claims
Find Restaurants in Jacksonville Accurate
Planning a Vacation in Dallas Inaccurate
Recommendations for Pizza in NYC Inaccurate
Dallas, AR, have highly rated spots for tiramisu? Inaccurate
User Query ACC Low Low+ Med Med+ High ITA
Create email list fed by Google Filled
Top Credit Unions Filled Filled Filled
What is there to do in Milledgeville Filled Filled
Samsung 32 in. Odyssey Neo G8 Filled Filled Filled
Maple Grove Farm Filled
Quest Labs Eugene Filled