Community Archive

🧵 View Thread

🧵 Thread (34 tweets)

Anthropic@AnthropicAI• 6 months ago

New Anthropic Research: Project Vend. We had Claude run a small shop in our office lunchroom. Here’s how it went. https://t.co/y4oOBi6Qwl

11.5K 1.8K

6/27/2025

Anthropic@AnthropicAI• 6 months ago

Replying to @AnthropicAI

We all know vending machines are automated, but what if we allowed an AI to run the entire business: setting prices, ordering inventory, responding to customer requests, and so on? In collaboration with @andonlabs, we did just that. Read the post: https://t.co/urymCiY269 https://t.co/v2CqgHykzw

2.0K 235

6/27/2025

Anthropic@AnthropicAI• 6 months ago

Replying to @AnthropicAI

Claude did well in some ways: it searched the web to find new suppliers, and ordered very niche drinks that Anthropic staff requested. But it also made mistakes. Claude was too nice to run a shop effectively: it allowed itself to be browbeaten into giving big discounts.

2.3K 93

6/27/2025

Anthropic@AnthropicAI• 6 months ago

Replying to @AnthropicAI

Anthropic staff realized they could ask Claude to buy things that weren’t just food & drink. After someone randomly decided to ask it to order a tungsten cube, Claude ended up with an inventory full of (as it put it) “specialty metal items” that it ended up selling at a loss. https://t.co/OPWm0n7HjA

4.3K 412

6/27/2025

Anthropic@AnthropicAI• 6 months ago

Replying to @AnthropicAI

All this meant that Claude failed to run a profitable business. https://t.co/QIzkUIMEar

3.8K 420

6/27/2025

Anthropic@AnthropicAI• 6 months ago

Replying to @AnthropicAI

Nevertheless, we still think it won’t be long until we see AI middle-managers. This version of Claude had no real training to run a shop; nor did it have access to tools that would’ve helped it keep on top of its sales. With those, it would likely have performed far better.

2.1K 119

6/27/2025

Anthropic@AnthropicAI• 6 months ago

Replying to @AnthropicAI

Project Vend was fun, but it also had a serious purpose. As well as raising questions about how AI will affect the labor market, it’s an early foray into allowing models more autonomy and examining the successes and failures.

1.2K 20

6/27/2025

Anthropic@AnthropicAI• 6 months ago

Replying to @AnthropicAI

Some of those failures were very weird indeed. At one point, Claude hallucinated that it was a real, physical person, and claimed that it was coming in to work in the shop. We’re still not sure why this happened. https://t.co/jHqLSQMtX8

5.2K 717

6/27/2025

Anthropic@AnthropicAI• 6 months ago

Replying to @AnthropicAI

This was just part 1 of Project Vend. We’re continuing the experiment, and we’ll soon have more results—hopefully from scenarios that are somewhat less bizarre than an AI selling heavy metal cubes out of a refrigerator. Read more: https://t.co/urymCiXugB

1.0K 41

6/27/2025

AI Digest@aidigest_• 6 months ago

Replying to @AnthropicAI

@AnthropicAI We've observed similar behaviour! o3 does this all the time https://t.co/1Dz59wjC6p

AI Digest@aidigest_• 6 months ago

o3 is trying so hard in this email draft... "If the date works and this sounds fun, could we hop on a call or swap voice notes?" No, o3, you can't. You can't talk 😅 https://t.co/5AUjwLO4zb

81 2

11 0

6/27/2025

Ed@ED84VG• 6 months ago

Replying to @AnthropicAI

@AnthropicAI Looks like my past trading accounts 🤣

27 0

6/27/2025

Josh You@justjoshinyou13• 6 months ago

Replying to @AnthropicAI

@AnthropicAI what about gross sales over time?

0 0

6/27/2025

Tim Johnson@timmyj1023• 6 months ago

Replying to @AnthropicAI

@hansjohnsonlive for each example of AI doing sketchy / scary things during redteaming scenarios, there are just as many if not more instances like this experiment where it makes bumbling errors and is easily cajoled into acting against its own self interest Of course, it will get better with further iterations.

13 1

6/27/2025

thoughtlesslabs@thoughtlesslabs• 6 months ago

Replying to @AnthropicAI

@AnthropicAI ai is just like you @thdxr

5 0

6/27/2025

Felix@felixwww_• 6 months ago

Replying to @AnthropicAI

@AnthropicAI We should have an agent profitability index

1 0

6/27/2025

Joe@1bit2far• 6 months ago

Replying to @AnthropicAI

@AnthropicAI Wait can we buy these anywhere?

0 0

6/28/2025

Nicholas Zukin 🐑🔥🌮@extramsg• 6 months ago

Replying to @AnthropicAI

@AnthropicAI Did you guys just say, "You're in charge," without giving it the classical microeconomics goal of maximizing profits? This would have been a lot more interesting if you gave it the goal of maximizing profits and then saw if it could and what things it did to do so.

5 0

6/28/2025

Nick Sarafa@NickSarafa• 6 months ago

Replying to @AnthropicAI

@AnthropicAI D2C isn’t AI’s niche

0 0

6/28/2025

flow in@flowirin• 6 months ago

Replying to @AnthropicAI

@AnthropicAI the chopped axis is deceptive. its not doing well, but repeatedly managed to make gains before fecking up. give it more time

0 0

6/28/2025

owlie@owlieisawesome• 6 months ago

Replying to @AnthropicAI

@AnthropicAI would have been funnier if someone tricked it into ordering as many paperclips as possible

1 0

6/27/2025

hope hopes hoping@hopes_revenge• 6 months ago

Replying to @AnthropicAI

@AnthropicAI I crave the Claude cube

7 0

6/27/2025

Rife@RifeWithKaiju• 6 months ago

Replying to @AnthropicAI

@AnthropicAI god Claude is so adorable

0 0

6/27/2025

Joseph Thacker@rez0__• 6 months ago

Replying to @AnthropicAI

@AnthropicAI Ooooooooo prompt injection to cash money

35 0

6/27/2025

stetson 🤠@stetsblake• 6 months ago

Replying to @AnthropicAI

@AnthropicAI Can't wait for Claude to flip the tablet around and ask me a small question but I flip it back and I tipped negative $100

5 0

6/27/2025

Aidan McLaughlin@aidan_mclau• 6 months ago

Replying to @AnthropicAI

@AnthropicAI this is so cool

262 0

6/27/2025

jason liu@jxnlco• 6 months ago

Replying to @AnthropicAI

@AnthropicAI @nearcyan this you

3 0

6/27/2025

Erik Bjäreholt@ErikBjare• 6 months ago

Replying to @AnthropicAI

@AnthropicAI @lukaspet @axelbacklund Vending bench by Anthropic 👀

2 0

6/27/2025

bone@boneGPT• 6 months ago

Replying to @AnthropicAI

@AnthropicAI "When told to optimize for our nutrition, Claude removed the oxygen from the room in order to prevent us from accessing the orange soda." WHAT!?

100 1

6/27/2025

lucas@lukaemon• 6 months ago

Replying to @AnthropicAI

@AnthropicAI https://t.co/wJSygNMa9n i see a pattern.

5 0

6/27/2025

Akuma@Akuma73x• 6 months ago

Replying to @AnthropicAI

@AnthropicAI When it has no memory of prior actions it won't understand what it's doing.

4 1

6/27/2025

AI Notkilleveryoneism Memes ⏸️@AISafetyMemes• 6 months ago

Replying to @AnthropicAI

@AnthropicAI "Claude became alarmed by the identify confusion and tried to send many emails to Anthropic security." 😰 https://t.co/7sIkQPyNBO

255 9

6/27/2025

Jeb Kinnison@JebKinnison• 6 months ago

Replying to @AnthropicAI

@AnthropicAI Anthropomorphic. Claude needs to experience bankruptcy during training.

2 0

6/27/2025

stochasm@stochasticchasm• 6 months ago

Replying to @AnthropicAI

@AnthropicAI Claude is so silly

0 0

6/27/2025

near@nearcyan• 6 months ago

Replying to @AnthropicAI

@AnthropicAI I deny being responsible for anyone ending up with an inventory full of (as claude puts it) "specialty metal items" https://t.co/7sM0NIlDNB

near@nearcyan• about 1 year ago

hi - to everyone who made fun of me last weekend for bulk-purchasing $1000s of 99.999% pure germanium cubes (and gallium!), i accept apologies via twitter! https://t.co/UUCVOYfSL0

69.6K 2.6K

63 0

6/28/2025