Community Archive

🧵 View Thread

🧵 Thread (27 tweets)

Placeholder
UltimApe@ultimape• about 3 years ago

A useful exploration on the topic of Goodhart's law"The dynamics highlighted are hopefully useful to explain many situations of interest in policy design, in machine learning, and in specific questions about AI alignment."https://t.co/Ka60eX7ra6 https://t.co/cymqBtsDxm

10 2
7/16/2022
Placeholder
UltimApe@ultimape• about 3 years ago
Replying to @ultimape

Goodhart's law in a nutshell."Asimov's three laws"https://t.co/7AxWjbmMIL

5 0
7/16/2022
Placeholder
UltimApe@ultimape• about 3 years ago
Replying to @ultimape

Good overview of the space of ideas here: https://t.co/Cxs6H9wfKshttps://t.co/DnabOdZCur

3 0
7/16/2022
Placeholder
UltimApe@ultimape• about 3 years ago
Replying to @ultimape

The not-so-obvious extension from economic theory to Schelling points, extended to biology. <3"It can be easier and more reliable to just learn and copy the proxy than to evolve the hidden quality and hope other people catch on" https://t.co/TQ1BSkHG9xhttps://t.co/l6hudT4wiE

6 0
7/16/2022
Placeholder
UltimApe@ultimape• about 3 years ago
Replying to @ultimape

"multi-agent failure modes are more complex, more problematic, and less well understood than the single-agent case, and are also already occurring, largely unnoticed. [...] why these failure modes are in some senses unavoidable." https://t.co/PsOXtSDW21 https://t.co/N3tyQqcrDc

5 0
7/16/2022
Placeholder
UltimApe@ultimape• about 3 years ago
Replying to @ultimape

Why I think it's funny to just not read books.https://t.co/VR8bJ4316u

4 0
7/16/2022
Placeholder
UltimApe@ultimape• about 3 years ago
Replying to @ultimape

Why should I read books when your library's book reading metric system spy's on what you are reading and falls for Goodhart's law in new and interesting ways!https://t.co/qgLqLyMqci

Placeholder
UltimApe@ultimape• over 3 years ago

Your research repository platforms and library book checkout tools are watching what you read.It's even encouraged as best practice https://t.co/QdEueduXuHPay no attention to massive data tracking conglomerate buying up services behind the curtain.https://t.co/oq7DjAukmA

3 0
2 0
7/16/2022
Placeholder
UltimApe@ultimape• about 3 years ago
Replying to @ultimape

"citizens dutifully complied and began breeding venomous snakes to kill and bring to the British. By the time the experiment was over, the snake problem was worse than when it began." https://t.co/iXb10YXD9Uhttps://t.co/eU6jUS0fSf

3 0
7/16/2022
Placeholder
UltimApe@ultimape• about 3 years ago
Replying to @ultimape

"it’s better to take a dead snake as an indicator of what point you are at in the cycle—i.e. you’re at the end of something in your life." https://t.co/GV9YmW8x9Z

1 0
7/16/2022
Placeholder
UltimApe@ultimape• about 3 years ago
Replying to @ultimape

"This is the superset of Goodhart-Campbell. The solution is the superset of costly signaling. The greater the divergence in the incentive structure between proxy and proxied, the greater the incentives to optimize for appearance." https://t.co/tsFuOEdVe8 https://t.co/ZsXeZnJSkQ

3 1
7/16/2022
Placeholder
UltimApe@ultimape• about 3 years ago
Replying to @ultimape

Sure is Noisy in here.https://t.co/uNQ2RL97Rq

Placeholder
UltimApe@ultimape• over 3 years ago

Sure is Noisy in here.https://t.co/Jy19pYiOa2

0 0
0 0
7/17/2022
Placeholder
UltimApe@ultimape• about 3 years ago
Replying to @ultimape

I hear carts clashing and cars honking. I feel the sun gazing down at me in the dead of winter as I lose the ability to feel my hand.https://t.co/a103p3xqwN

Placeholder
UltimApe@ultimape• over 3 years ago

It was also the year I learned about Goodhart’s Law.Why was I reading Goodhart's law as a cart-pusher?https://t.co/HEg41j8uOT

2 0
0 0
7/24/2022
Placeholder
UltimApe@ultimape• about 3 years ago
Replying to @ultimape

"Don't act so innocent, this was no accidentYou planned this in the end and now it's overSay what you wanna say, you lied and I got playedYou threw it all away and now it's over" 🎶https://t.co/XeYh7vMGTEhttps://t.co/ITmUnvhTXE

Placeholder
UltimApe@ultimape• about 3 years ago

A fool and his master are soon bitten by a snake. As the parable of the tortoise and the scorpion...it is in their nature.https://t.co/lODvv4Mlw5

0 0
0 0
8/22/2022
Placeholder
UltimApe@ultimape• almost 3 years ago
Replying to @ultimape

Thinking a lot about the principle-agent problem in the context of actor-network theory right now.How does one gain more agency over their own life?https://t.co/gZ5OTovUdc

Placeholder
Visakan Veerasamy@visakanv• almost 3 years ago

my contribution to NPC discourse is that time spent discussing NPCs is time not spent increasing your own agency and the agency of your peers who also want to have more agency. Focus on what you want to see more of https://t.co/JSIFLLKyEs

71 2
2 0
9/28/2022
Placeholder
UltimApe@ultimape• almost 3 years ago
Replying to @ultimape

I did not.https://t.co/TJ64BFY2U6

0 0
9/28/2022
Placeholder
UltimApe@ultimape• almost 3 years ago
Replying to @ultimape

Hey @lukestein please have your bot not harass my tweets. kthnx. https://t.co/2SfAExNFUw

Tweet image 1
2 0
9/28/2022
Placeholder
UltimApe@ultimape• almost 3 years ago
Replying to @ultimape

@lukestein Your bot haunting my misspelling literally hits a part of me being suicidal. So having random tweets show up hidden is kinda shitty. have you considered the principle agent problem of writing a spelling correction bot and it's impact on behavior? https://t.co/TYb0UDXE9P

Placeholder
UltimApe@ultimape• over 3 years ago

@principle_agent Did you known that a large proportion of children who kill themselves have spelling and penmanship errors in their suicided notes, and are largely driven to this via shame from others policing them over their failures?

1 0
1 0
9/28/2022
Placeholder
UltimApe@ultimape• almost 3 years ago
Replying to @ultimape

@lukestein This is disgustinghttps://t.co/5eqCTBAATS

1 0
9/28/2022
Placeholder
UltimApe@ultimape• almost 3 years ago
Replying to @ultimape

@lukestein I can't report your bot's tweets because it blocks me. I consider it a form of harassment and evasion. Please add my account as an exception to not be policed over some asinine spelling SHibələTH.

1 0
9/28/2022
Placeholder
UltimApe@ultimape• almost 3 years ago
Replying to @ultimape

Neat."We suggest that the analysis of incomplete contracting developed by law and economics researchers can provide a useful framework for understanding the AI alignment problem and help to generate a systematic approach to finding solutions"https://t.co/jHzuJESoCx

1 0
9/28/2022
Placeholder
UltimApe@ultimape• almost 3 years ago
Replying to @ultimape

Oooh, they got some good publications https://t.co/RqETNzqF4X

1 0
9/28/2022
Placeholder
UltimApe@ultimape• almost 3 years ago
Replying to @ultimape

"from social sciences, e.g., Actor-Network Theory (ANT) can support the development of tracing networks of association amongst many actors, which can help understand how, [...] humans may offload value-laden behavior onto the technology around us." https://t.co/g0RFxbuvMn

1 0
9/28/2022
Placeholder
UltimApe@ultimape• almost 3 years ago
Replying to @ultimape

Tired: AI alignment problemWired: AI going mad.https://t.co/EBh9efgz6x

Placeholder
UltimApe@ultimape• almost 11 years ago

"The inside of a computer is dumb as hell but it goes like mad!"~Richard Feynman The Feynman Lectures on Computation http://t.co/0d1XzYNOtf

7 2
2 0
9/28/2022
Placeholder
UltimApe@ultimape• almost 3 years ago
Replying to @ultimape

Inspired: Memetic-Cultural alignment problem.Perspired: Do not feed the egregore after midnight.Transpired: The care and feeding of your egregore.

1 0
9/28/2022
Placeholder
UltimApe@ultimape• almost 3 years ago
Replying to @ultimape

I've got a lovely bunch of paperclips...https://t.co/GWKoTbA4qj

Placeholder
UltimApe@ultimape• almost 3 years ago

huh."They’re converging on the idea that it’s time to stop using gross domestic product (GDP) as the world’s main measure of prosperity, and to complement it with a dashboard of indicators, possibly ones linked to the [Sustainable Development Goals]."https://t.co/Jfg24xcXBx

2 0
0 0
11/17/2022
Placeholder
UltimApe@ultimape• almost 3 years ago
Replying to @ultimape

Runaway utility maximization, in my economic proxy for maximizing paper clip extraction services? https://t.co/JG8fGLmPq4

Placeholder
UltimApe@ultimape• over 9 years ago

1/ A robot may not lose profits for an organization or through inaction allow profits to be lost.

55 41
0 0
11/17/2022
Placeholder
UltimApe@ultimape• 12 months ago
Replying to @ultimape

While it's true that the cliche of "the cobra effect" is anecdotal. The wikipedia has examples of this effect happening that are documented. https://t.co/Vd0FU1N1R9Good to be nuanced here.https://t.co/GpZlq4kVhM https://t.co/OuXW8lRqkc

Tweet image 1
1 0
9/27/2024