Community Archive

🧵 View Thread

🧵 Thread (24 tweets)

Placeholder
George@georgejrjrjr• 7 months ago

> GPT model stopped speaking Croatian > Nobody could figure out why. Turns out > Croatian users were much more prone downvote messages https://t.co/NHgqUJXAOy

Tweet image 1
Tweet image 2
14.3K 1.1K
4/30/2025
Placeholder
Ashita Orbis@AshitaOrbis• 7 months ago
Replying to @georgejrjrjr

@georgejrjrjr I've always wondered if anyone pressed the upvote or downvote buttons on ChatGPT, and if they did anything. This is a surprising answer to both

43 0
4/30/2025
Placeholder
konzuko@konzuko• 7 months ago
Replying to @georgejrjrjr

@georgejrjrjr it was after i read the llama3 paper that i learnt just how janky model training is. reading it changed my life

167 2
5/1/2025
Placeholder
George@georgejrjrjr• 7 months ago
Replying to @konzuko

@konzuko Then I suspect you’ll enjoy this.

Placeholder
Nathan Lambert@natolambert• 10 months ago

As the only AI lab that can share 100% of the details of training language models, at @allen_ai we're really kind of obligated to share more on how it works (and what doesn't). Here's a reflection with @mechanicaldirk @kylelostat & @soldni on OLMo 2 and what comes next! 00:00:00 Introduction 00:02:45 Early history of the OLMo project 00:15:27 The journey to stability 00:25:00 The evolving role of OLMo and pretraining research 00:29:00 Pretraining Q&A (µP, scaling laws, MoE, etc.) 00:40:40 How to think about pretraining data work 00:54:30 Role of pre-training vs mid training vs post-training 01:02:19 Release strategy and wrapping up Links below.

Quoted tweet image 1
417 80
74 4
5/1/2025
Placeholder
Jake@jakub864• 7 months ago
Replying to @georgejrjrjr

@georgejrjrjr That doesn't make sense, wouldn't English be downvotd more than Croatian by Croation users?

12 0
5/1/2025
Placeholder
George@georgejrjrjr• 7 months ago
Replying to @jakub864

Object level: Croats are presumably the lion’s share of users speaking Croatian with the model. Whereas they are a drop in the bucket for English. Meta-level: I unironically appreciate your earnest engagement with my shitpost. …in part because there’s a Sapir-Whorf angle I hadn’t thought to consider: do polyglots’ apparent preferences differ based on the language they’re speaking? I have no idea and I wonder if this has ever been studied.

115 1
5/1/2025
Placeholder
Chris@knowclarified• 7 months ago
Replying to @georgejrjrjr

@georgejrjrjr Link to article?

4 0
5/1/2025
Placeholder
George@georgejrjrjr• 7 months ago
Replying to @knowclarified

@knowclarified No can do, sadly. Will deleted it.

Placeholder
George@georgejrjrjr• 7 months ago

@gwern Affirmative. Relatedly, if anyone captured the rest of it I’d like a copy. (DMs open + Signal in bio).

25 1
9 0
5/1/2025
Placeholder
Shubham Arora@shubham_arora_0• 7 months ago
Replying to @georgejrjrjr

@georgejrjrjr the Eastern European stereotype holds šŸ˜‚

5 0
5/1/2025
Placeholder
Pale Blue Mediocrity@MatveyShishov• 7 months ago
Replying to @georgejrjrjr

@georgejrjrjr Hilarious! So ChatGPT wasn't artificially overhyped maybe? Just an effect of mostly talking with American users?

110 1
5/1/2025
Placeholder
George@georgejrjrjr• 7 months ago
Replying to @MatveyShishov

@MatveyShishov which tho? america is a mosaic wrt glazing norms.

25 1
5/1/2025
Placeholder
Lazar Stojković āš”ļø@LazarStojkovic• 7 months ago
Replying to @georgejrjrjr

@georgejrjrjr Croatia is in Southern, not Eastern Europe. https://t.co/YYHnnbgRNd https://t.co/6kZm5X8fnb

Tweet image 1
39 3
5/1/2025
Placeholder
George@georgejrjrjr• 7 months ago
Replying to @LazarStojkovic

@LazarStojkovic depends on who you ask https://t.co/57RXcB4Z9j

Tweet image 1
116 0
5/1/2025
Placeholder
ļøŽ@8lackPrince• 7 months ago
Replying to @georgejrjrjr

@georgejrjrjr > GPT gave up speaking croatia https://t.co/rmsvxIlBhJ

Tweet image 1
59 0
5/1/2025
Placeholder
George@georgejrjrjr• 7 months ago
Replying to @georgejrjrjr

šŸ™ to the anon who captured and dm’d the second half of the tweet. https://t.co/T3XGRodgP7

Tweet image 1
326 9
5/1/2025
Placeholder
Loweren@See_Elegance• 7 months ago
Replying to @georgejrjrjr

@georgejrjrjr What I don't get is: why does OpenAI have such an issue writing prize-winning prose and personalising their text output to the user, but Midjourney has no problem making prize-winning and personalized images? Why is text harder to optimize than photos?

1 0
5/1/2025
Placeholder
sdmat@sdmat123• 7 months ago
Replying to @georgejrjrjr

I don't see the distinction between things people don't like that they favor in their revealed preferences and optimization failures. Surely the failures he cites for the latter stem from the objective and data rather than the optimization/learning process itself. I have a revealed preference for delicious, unhealthy fast food. If I hire a chef to make healthy meals and they follow my revealed preference rather than expressed preference that would be a problem with their objective and not a failure to optimize.

4 0
5/1/2025
Placeholder
Conor@jconorgrogan• 7 months ago
Replying to @georgejrjrjr

@georgejrjrjr total american cultural AI victory, this is how we win https://t.co/OSc6ZQry3M

6 0
5/1/2025
Placeholder
Pierre Bongrand@bongrandp• 7 months ago
Replying to @georgejrjrjr

@georgejrjrjr what do you think @Teknium1 ? is post training this tricky/sensitive? did OpenAI messed up by pushing this model to 800M users without enough testing? or a bit of both?

15 0
5/1/2025
Placeholder
George@georgejrjrjr• 7 months ago
Replying to @bongrandp

@bongrandp @Teknium1 sft maximalists don’t have reward model problems

13 0
5/1/2025
Placeholder
king midas 霊@breath_mirror• 7 months ago
Replying to @georgejrjrjr

@georgejrjrjr it also started speaking spanish as argentinians (which is a subset of spanish who uses ā€œvosā€ rather than ā€œtuā€ for the word ā€œyouā€, plus different conjugations no one knows why either @willdepue

5 0
5/1/2025
Placeholder
Bart Stewart@flatfingers0• 7 months ago
Replying to @georgejrjrjr

@georgejrjrjr This has been floating around for some years now, in the same spirit: https://t.co/0mdGIwIxMj

Tweet image 1
46 4
5/1/2025
Placeholder
Pascal-Emmanuel Gobry@pegobry_en• 7 months ago
Replying to @georgejrjrjr

@georgejrjrjr @FistedFoucault

6 0
5/1/2025
Placeholder
Andrew Clough@aclough• 7 months ago
Replying to @georgejrjrjr

@georgejrjrjr In all sorts of automated user feedback situations it seems obvious that you ought to normalize the feedback of each person. A thumbs up from someone who always does that is much less meaningful than from someone who gives them rarely. But I never see companies do that?

25 0
5/1/2025