Community Archive

🧵 View Thread

🧵 Thread (24 tweets)

George@georgejrjrjr• 9 months ago

> GPT model stopped speaking Croatian > Nobody could figure out why. Turns out > Croatian users were much more prone downvote messages https://t.co/NHgqUJXAOy

14.3K 1.1K

4/30/2025

Ashita Orbis@AshitaOrbis• 9 months ago

Replying to @georgejrjrjr

@georgejrjrjr I've always wondered if anyone pressed the upvote or downvote buttons on ChatGPT, and if they did anything. This is a surprising answer to both

43 0

4/30/2025

konzuko@konzuko• 9 months ago

Replying to @georgejrjrjr

@georgejrjrjr it was after i read the llama3 paper that i learnt just how janky model training is. reading it changed my life

167 2

5/1/2025

George@georgejrjrjr• 9 months ago

Replying to @konzuko

@konzuko Then I suspect you’ll enjoy this.

Nathan Lambert@natolambert• about 1 year ago

As the only AI lab that can share 100% of the details of training language models, at @allen_ai we're really kind of obligated to share more on how it works (and what doesn't). Here's a reflection with @mechanicaldirk @kylelostat & @soldni on OLMo 2 and what comes next! 00:00:00 Introduction 00:02:45 Early history of the OLMo project 00:15:27 The journey to stability 00:25:00 The evolving role of OLMo and pretraining research 00:29:00 Pretraining Q&A (µP, scaling laws, MoE, etc.) 00:40:40 How to think about pretraining data work 00:54:30 Role of pre-training vs mid training vs post-training 01:02:19 Release strategy and wrapping up Links below.

417 80

74 4

5/1/2025

Jake@jakub864• 9 months ago

Replying to @georgejrjrjr

@georgejrjrjr That doesn't make sense, wouldn't English be downvotd more than Croatian by Croation users?

12 0

5/1/2025

George@georgejrjrjr• 9 months ago

Replying to @jakub864

Object level: Croats are presumably the lion’s share of users speaking Croatian with the model. Whereas they are a drop in the bucket for English. Meta-level: I unironically appreciate your earnest engagement with my shitpost. …in part because there’s a Sapir-Whorf angle I hadn’t thought to consider: do polyglots’ apparent preferences differ based on the language they’re speaking? I have no idea and I wonder if this has ever been studied.

115 1

5/1/2025

Chris@knowclarified• 9 months ago

Replying to @georgejrjrjr

@georgejrjrjr Link to article?

4 0

5/1/2025

George@georgejrjrjr• 9 months ago

Replying to @knowclarified

@knowclarified No can do, sadly. Will deleted it.

George@georgejrjrjr• 9 months ago

@gwern Affirmative. Relatedly, if anyone captured the rest of it I’d like a copy. (DMs open + Signal in bio).

25 1

9 0

5/1/2025

Shubham Arora@shubham_arora_0• 9 months ago

Replying to @georgejrjrjr

@georgejrjrjr the Eastern European stereotype holds 😂

5 0

5/1/2025

Pale Blue Mediocrity@MatveyShishov• 9 months ago

Replying to @georgejrjrjr

@georgejrjrjr Hilarious! So ChatGPT wasn't artificially overhyped maybe? Just an effect of mostly talking with American users?

110 1

5/1/2025

George@georgejrjrjr• 9 months ago

Replying to @MatveyShishov

@MatveyShishov which tho? america is a mosaic wrt glazing norms.

25 1

5/1/2025

Lazar Stojković ⚡️@LazarStojkovic• 9 months ago

Replying to @georgejrjrjr

@georgejrjrjr Croatia is in Southern, not Eastern Europe. https://t.co/YYHnnbgRNd https://t.co/6kZm5X8fnb

39 3

5/1/2025

George@georgejrjrjr• 9 months ago

Replying to @LazarStojkovic

@LazarStojkovic depends on who you ask https://t.co/57RXcB4Z9j

116 0

5/1/2025

︎@8lackPrince• 9 months ago

Replying to @georgejrjrjr

@georgejrjrjr > GPT gave up speaking croatia https://t.co/rmsvxIlBhJ

59 0

5/1/2025

George@georgejrjrjr• 9 months ago

Replying to @georgejrjrjr

🙏 to the anon who captured and dm’d the second half of the tweet. https://t.co/T3XGRodgP7

326 9

5/1/2025

Loweren@See_Elegance• 9 months ago

Replying to @georgejrjrjr

@georgejrjrjr What I don't get is: why does OpenAI have such an issue writing prize-winning prose and personalising their text output to the user, but Midjourney has no problem making prize-winning and personalized images? Why is text harder to optimize than photos?

1 0

5/1/2025

sdmat@sdmat123• 9 months ago

Replying to @georgejrjrjr

I don't see the distinction between things people don't like that they favor in their revealed preferences and optimization failures. Surely the failures he cites for the latter stem from the objective and data rather than the optimization/learning process itself. I have a revealed preference for delicious, unhealthy fast food. If I hire a chef to make healthy meals and they follow my revealed preference rather than expressed preference that would be a problem with their objective and not a failure to optimize.

4 0

5/1/2025

Conor@jconorgrogan• 9 months ago

Replying to @georgejrjrjr

@georgejrjrjr total american cultural AI victory, this is how we win https://t.co/OSc6ZQry3M

6 0

5/1/2025

Pierre Bongrand@bongrandp• 9 months ago

Replying to @georgejrjrjr

@georgejrjrjr what do you think @Teknium1 ? is post training this tricky/sensitive? did OpenAI messed up by pushing this model to 800M users without enough testing? or a bit of both?

15 0

5/1/2025

George@georgejrjrjr• 9 months ago

Replying to @bongrandp

@bongrandp @Teknium1 sft maximalists don’t have reward model problems

13 0

5/1/2025

king midas 霊@breath_mirror• 9 months ago

Replying to @georgejrjrjr

@georgejrjrjr it also started speaking spanish as argentinians (which is a subset of spanish who uses “vos” rather than “tu” for the word “you”, plus different conjugations no one knows why either @willdepue

5 0

5/1/2025

Bart Stewart@flatfingers0• 9 months ago

Replying to @georgejrjrjr

@georgejrjrjr This has been floating around for some years now, in the same spirit: https://t.co/0mdGIwIxMj

46 4

5/1/2025

Pascal-Emmanuel Gobry@pegobry_en• 9 months ago

Replying to @georgejrjrjr

@georgejrjrjr @FistedFoucault

6 0

5/1/2025

Andrew Clough@aclough• 9 months ago

Replying to @georgejrjrjr

@georgejrjrjr In all sorts of automated user feedback situations it seems obvious that you ought to normalize the feedback of each person. A thumbs up from someone who always does that is much less meaningful than from someone who gives them rarely. But I never see companies do that?

25 0

5/1/2025