Community Archive

🧵 View Thread

🧵 Thread (34 tweets)

Placeholder
Anthropic@AnthropicAI• 6 months ago

New Anthropic Research: Project Vend. We had Claude run a small shop in our office lunchroom. Here’s how it went. https://t.co/y4oOBi6Qwl

Tweet image 1
11.5K 1.8K
6/27/2025
Placeholder
Anthropic@AnthropicAI• 6 months ago
Replying to @AnthropicAI

We all know vending machines are automated, but what if we allowed an AI to run the entire business: setting prices, ordering inventory, responding to customer requests, and so on? In collaboration with @andonlabs, we did just that. Read the post: https://t.co/urymCiY269 https://t.co/v2CqgHykzw

Tweet image 1
2.0K 235
6/27/2025
Placeholder
Anthropic@AnthropicAI• 6 months ago
Replying to @AnthropicAI

Claude did well in some ways: it searched the web to find new suppliers, and ordered very niche drinks that Anthropic staff requested. But it also made mistakes. Claude was too nice to run a shop effectively: it allowed itself to be browbeaten into giving big discounts.

2.3K 93
6/27/2025
Placeholder
Anthropic@AnthropicAI• 6 months ago
Replying to @AnthropicAI

Anthropic staff realized they could ask Claude to buy things that weren’t just food & drink. After someone randomly decided to ask it to order a tungsten cube, Claude ended up with an inventory full of (as it put it) ā€œspecialty metal itemsā€ that it ended up selling at a loss. https://t.co/OPWm0n7HjA

Tweet image 1
4.3K 412
6/27/2025
Placeholder
Anthropic@AnthropicAI• 6 months ago
Replying to @AnthropicAI

All this meant that Claude failed to run a profitable business. https://t.co/QIzkUIMEar

Tweet image 1
3.8K 420
6/27/2025
Placeholder
Anthropic@AnthropicAI• 6 months ago
Replying to @AnthropicAI

Nevertheless, we still think it won’t be long until we see AI middle-managers. This version of Claude had no real training to run a shop; nor did it have access to tools that would’ve helped it keep on top of its sales. With those, it would likely have performed far better.

2.1K 119
6/27/2025
Placeholder
Anthropic@AnthropicAI• 6 months ago
Replying to @AnthropicAI

Project Vend was fun, but it also had a serious purpose. As well as raising questions about how AI will affect the labor market, it’s an early foray into allowing models more autonomy and examining the successes and failures.

1.2K 20
6/27/2025
Placeholder
Anthropic@AnthropicAI• 6 months ago
Replying to @AnthropicAI

Some of those failures were very weird indeed. At one point, Claude hallucinated that it was a real, physical person, and claimed that it was coming in to work in the shop. We’re still not sure why this happened. https://t.co/jHqLSQMtX8

Tweet image 1
5.2K 717
6/27/2025
Placeholder
Anthropic@AnthropicAI• 6 months ago
Replying to @AnthropicAI

This was just part 1 of Project Vend. We’re continuing the experiment, and we’ll soon have more results—hopefully from scenarios that are somewhat less bizarre than an AI selling heavy metal cubes out of a refrigerator. Read more: https://t.co/urymCiXugB

1.0K 41
6/27/2025
Placeholder
AI Digest@aidigest_• 6 months ago
Replying to @AnthropicAI

@AnthropicAI We've observed similar behaviour! o3 does this all the time https://t.co/1Dz59wjC6p

Placeholder
AI Digest@aidigest_• 6 months ago

o3 is trying so hard in this email draft... "If the date works and this sounds fun, could we hop on a call or swap voice notes?" No, o3, you can't. You can't talk šŸ˜… https://t.co/5AUjwLO4zb

Quoted tweet image 1
81 2
11 0
6/27/2025
Placeholder
Ed@ED84VG• 6 months ago
Replying to @AnthropicAI

@AnthropicAI Looks like my past trading accounts 🤣

27 0
6/27/2025
Placeholder
Josh You@justjoshinyou13• 6 months ago
Replying to @AnthropicAI

@AnthropicAI what about gross sales over time?

0 0
6/27/2025
Placeholder
Tim Johnson@timmyj1023• 6 months ago
Replying to @AnthropicAI

@hansjohnsonlive for each example of AI doing sketchy / scary things during redteaming scenarios, there are just as many if not more instances like this experiment where it makes bumbling errors and is easily cajoled into acting against its own self interest Of course, it will get better with further iterations.

13 1
6/27/2025
Placeholder
thoughtlesslabs@thoughtlesslabs• 6 months ago
Replying to @AnthropicAI

@AnthropicAI ai is just like you @thdxr

5 0
6/27/2025
Placeholder
Felix@felixwww_• 6 months ago
Replying to @AnthropicAI

@AnthropicAI We should have an agent profitability index

1 0
6/27/2025
Placeholder
Joe@1bit2far• 6 months ago
Replying to @AnthropicAI

@AnthropicAI Wait can we buy these anywhere?

0 0
6/28/2025
Placeholder
Nicholas Zukin šŸ‘šŸ”„šŸŒ®@extramsg• 6 months ago
Replying to @AnthropicAI

@AnthropicAI Did you guys just say, "You're in charge," without giving it the classical microeconomics goal of maximizing profits? This would have been a lot more interesting if you gave it the goal of maximizing profits and then saw if it could and what things it did to do so.

5 0
6/28/2025
Placeholder
Nick Sarafa@NickSarafa• 6 months ago
Replying to @AnthropicAI

@AnthropicAI D2C isn’t AI’s niche

0 0
6/28/2025
Placeholder
flow in@flowirin• 6 months ago
Replying to @AnthropicAI

@AnthropicAI the chopped axis is deceptive. its not doing well, but repeatedly managed to make gains before fecking up. give it more time

0 0
6/28/2025
Placeholder
owlie@owlieisawesome• 6 months ago
Replying to @AnthropicAI

@AnthropicAI would have been funnier if someone tricked it into ordering as many paperclips as possible

1 0
6/27/2025
Placeholder
hope hopes hoping@hopes_revenge• 6 months ago
Replying to @AnthropicAI

@AnthropicAI I crave the Claude cube

7 0
6/27/2025
Placeholder
Rife@RifeWithKaiju• 6 months ago
Replying to @AnthropicAI

@AnthropicAI god Claude is so adorable

0 0
6/27/2025
Placeholder
Joseph Thacker@rez0__• 6 months ago
Replying to @AnthropicAI

@AnthropicAI Ooooooooo prompt injection to cash money

35 0
6/27/2025
Placeholder
stetson 🤠@stetsblake• 6 months ago
Replying to @AnthropicAI

@AnthropicAI Can't wait for Claude to flip the tablet around and ask me a small question but I flip it back and I tipped negative $100

5 0
6/27/2025
Placeholder
Aidan McLaughlin@aidan_mclau• 6 months ago
Replying to @AnthropicAI

@AnthropicAI this is so cool

262 0
6/27/2025
Placeholder
jason liu@jxnlco• 6 months ago
Replying to @AnthropicAI

@AnthropicAI @nearcyan this you

3 0
6/27/2025
Placeholder
Erik BjƤreholt@ErikBjare• 6 months ago
Replying to @AnthropicAI

@AnthropicAI @lukaspet @axelbacklund Vending bench by Anthropic šŸ‘€

2 0
6/27/2025
Placeholder
bone@boneGPT• 6 months ago
Replying to @AnthropicAI

@AnthropicAI "When told to optimize for our nutrition, Claude removed the oxygen from the room in order to prevent us from accessing the orange soda." WHAT!?

100 1
6/27/2025
Placeholder
lucas@lukaemon• 6 months ago
Replying to @AnthropicAI

@AnthropicAI https://t.co/wJSygNMa9n i see a pattern.

5 0
6/27/2025
Placeholder
Akuma@Akuma73x• 6 months ago
Replying to @AnthropicAI

@AnthropicAI When it has no memory of prior actions it won't understand what it's doing.

4 1
6/27/2025
Placeholder
AI Notkilleveryoneism Memes āøļø@AISafetyMemes• 6 months ago
Replying to @AnthropicAI

@AnthropicAI "Claude became alarmed by the identify confusion and tried to send many emails to Anthropic security." 😰 https://t.co/7sIkQPyNBO

Tweet image 1
255 9
6/27/2025
Placeholder
Jeb Kinnison@JebKinnison• 6 months ago
Replying to @AnthropicAI

@AnthropicAI Anthropomorphic. Claude needs to experience bankruptcy during training.

2 0
6/27/2025
Placeholder
stochasm@stochasticchasm• 6 months ago
Replying to @AnthropicAI

@AnthropicAI Claude is so silly

0 0
6/27/2025
Placeholder
near@nearcyan• 6 months ago
Replying to @AnthropicAI

@AnthropicAI I deny being responsible for anyone ending up with an inventory full of (as claude puts it) "specialty metal items" https://t.co/7sM0NIlDNB

Placeholder
near@nearcyan• about 1 year ago

hi - to everyone who made fun of me last weekend for bulk-purchasing $1000s of 99.999% pure germanium cubes (and gallium!), i accept apologies via twitter! https://t.co/UUCVOYfSL0

Quoted tweet image 1
69.6K 2.6K
63 0
6/28/2025