Ensuring a good
future with advanced
AI systems

Ensuring a good future with advanced AI systems

Ensuring a good future with advanced AI systems

Our research agenda focuses on building Cognitive Emulation - an AI architecture that bounds systems' capabilities and makes them reason in ways that humans can understand and control.

Our research agenda focuses on building Cognitive Emulation - an AI architecture that bounds systems' capabilities and makes them reason in ways that humans can understand and control.

Our research agenda focuses on building Cognitive Emulation - an AI architecture that bounds systems' capabilities and makes them reason in ways that humans can understand and control.

Cognitive Emulation Articles

Cognitive Emulation Articles

Oct 19, 2023

An Introduction to Cognitive Emulation

An Introduction to Cognitive Emulation
An Introduction to Cognitive Emulation

All human labor, from writing an email to designing a spaceship, is built on cognition. Somewhere along the way, human intuition is used in a cognitive algorithm to form a meaningful idea or perform a meaningful task. Over time, these ideas and actions accrue in communication, formulating plans, researching opportunities, and building solutions. Society is entirely composed of these patterns, and Cognitive Emulation is built to learn and emulate them.

All human labor, from writing an email to designing a spaceship, is built on cognition. Somewhere along the way, human intuition is used in a cognitive algorithm to form a meaningful idea or perform a meaningful task. Over time, these ideas and actions accrue in communication, formulating plans, researching opportunities, and building solutions. Society is entirely composed of these patterns, and Cognitive Emulation is built to learn and emulate them.

All human labor, from writing an email to designing a spaceship, is built on cognition. Somewhere along the way, human intuition is used in a cognitive algorithm to form a meaningful idea or perform a meaningful task. Over time, these ideas and actions accrue in communication, formulating plans, researching opportunities, and building solutions. Society is entirely composed of these patterns, and Cognitive Emulation is built to learn and emulate them.

Alignment Articles

Alignment Articles

Oct 21, 2023

Alignment

Alignment
Alignment

Today, one paradigm dominates the AI industry: build superintelligence by scaling up blackbox, monolithic neural network models as fast as possible. Alarmingly, there is consensus in the AI safety community that there are no known techniques to make superintelligent systems safe. Without properly “aligning” these systems, deploying them could lead to devastating consequences.

Today, one paradigm dominates the AI industry: build superintelligence by scaling up blackbox, monolithic neural network models as fast as possible. Alarmingly, there is consensus in the AI safety community that there are no known techniques to make superintelligent systems safe. Without properly “aligning” these systems, deploying them could lead to devastating consequences. 

Today, one paradigm dominates the AI industry: build superintelligence by scaling up blackbox, monolithic neural network models as fast as possible. Alarmingly, there is consensus in the AI safety community that there are no known techniques to make superintelligent systems safe. Without properly “aligning” these systems, deploying them could lead to devastating consequences. 

Oct 12, 2023

unRLHF - Efficiently undoing LLM safeguards

Produced as part of the SERI ML Alignment Theory Scholars Program - Summer 2023 Cohort, under the mentorship of Jeffrey Ladish. I'm grateful to Palisade Research for their support throughout this project.

The First Filter
The First Filter

Nov 26, 2022

The First Filter

Consistently optimizing for solving alignment (or any other difficult problem) is incredibly hard. The first and most obvious obstacle is that you need to actually care about alignment and feel responsible for solving it. You cannot just ignore it or pass the buck; you need to aim for it.

Abstracting The Hardness of Alignment: Unbounded Atomic Optimization
Abstracting The Hardness of Alignment: Unbounded Atomic Optimization

Jul 29, 2022

Abstracting The Hardness of Alignment: Unbounded Atomic Optimization

If there's one thing alignment researchers excel at, it's disagreeing with each other. I dislike the term pre paradigmatic, but even I must admit that it captures one obvious feature of the alignment field: the constant debates about the what and the how and the value of different attempts. Recently, we even had a whole sequence of debates, and since I first wrote this post Nate shared his take on why he can’t see any current work in the field actually tackling the problem. More generally, the culture of disagreement and debate and criticism is obvious to anyone reading the AF.

How to Diversify Conceptual Alignment: the Model Behind Refine
How to Diversify Conceptual Alignment: the Model Behind Refine

Jul 20, 2022

How to Diversify Conceptual Alignment: the Model Behind Refine

We need far more conceptual AI alignment research approaches than we have now if we want to increase our chances to solve the alignment problem. However, the conceptual alignment field remains hard to access, and what feedback and mentorship there is focuses around few existing research directions rather than stimulating new ideas.

Epistemological Vigilance for Alignment
Epistemological Vigilance for Alignment

Jun 6, 2022

Epistemological Vigilance for Alignment

Nothing hampers Science and Engineering like unchecked assumptions. As a concrete example of a field ridden with hidden premises, let's look at sociology. Sociologist must deal with the feedback of their object of study (people in social situations), their own social background, as well as the myriad of folk sociology notions floating in the memesphere.

Productive Mistakes, Not Perfect Answers
Productive Mistakes, Not Perfect Answers

Apr 9, 2022

Productive Mistakes, Not Perfect Answers

I wouldn’t bet on any current alignment proposal. Yet I think that the field is making progress and abounds with interesting opportunities to do even more, giving us a shot. Isn’t there a contradiction? No, because research progress so rarely looks like having a clearly correct insight that clarifies everything; instead it often looks like building on apparently unpromising ideas, or studying the structure of the problem.

Sign up to receive our newsletter and
updates on products and services.

Sign up to receive our newsletter and updates on products and services.

Sign up to receive our newsletter and updates on products and services.

Sign Up