Brian Colwell, Author at Brian D. Colwell

What Are Path-Finding Attacks?

Posted on June 7, 2025June 7, 2025 by Brian Colwell

Path-Finding is a specialized model extraction attack that targets tree-based machine learning models, such as decision trees and random forests, exploiting confidence values and using the rich information provided by APIs on…

An Introduction To AI Side-Channel Attacks

Posted on June 7, 2025June 7, 2025 by Brian Colwell

Side-Channel Attacks exploit unintended information leakage through observable physical or logical system behaviors such as memory usage, timing information, power consumption, or electromagnetic emissions. Rather than directly querying the model, these attacks…

An Introduction To Defenses For AI Side-Channel Attacks

Posted on June 7, 2025June 7, 2025 by Brian Colwell

A side-channel attack is a security exploit that targets information gained from the implementation of a system, rather than attacking the system’s functionality directly. These attacks extract sensitive information by observing the…

Recommendations To Anthropic On Claude’s Constitutional Principles

Posted on June 6, 2025June 6, 2025 by Brian Colwell

The primary sources Anthropic utilized in designing Claude’s Constitution include: The Universal Declaration of Human Rights (UDHR), Apple’s Terms Of Service, DeepMind’s Sparrow Rules, and Anthropic research sets 1 & 2. It…

What Are The Principles Upon Which The Constitution Of Anthropic’s Claude Is Built?

Posted on June 6, 2025June 6, 2025 by Brian Colwell

Below the reader will find Claude’s complete set of principles from ‘Claude’s Constitution’ dated May 9, 2023. Before we get into the principles, however, Anthropic wants to “emphasize that our current constitution…

Constitutional AI Aligns Anthropic’s Claude With Human Values

Posted on June 6, 2025June 6, 2025 by Brian Colwell

The dawn of artificial general intelligence (AGI) brings with it a complex landscape of profound ethical risks, from bias and regulatory uncertainties to the looming threat of manipulation and AI weaponization. At…

Sentient’s Loyal AI Resolves AGI Ethical Risks?

Posted on June 6, 2025June 6, 2025 by Brian Colwell

The primary ethical risks associated with AGI include bias & discrimination, existential risks, governance & regulation, manipulation & loss of human autonomy, and weaponization. It is only through cooperation, decentralized systems, and…

What Are The Ethical Risks Of Strong AI?

Posted on June 6, 2025June 6, 2025 by Brian Colwell

The potential for bias, discrimination, and the existential threats posed by misalignment between AGI and human values are deeply concerning. The purpose of this blog post is to highlight clearly and concisely…

Is AGI An Asymmetric Threat?

Posted on June 6, 2025June 6, 2025 by Brian Colwell

A strong AI with autonomy, but without the alignment, control mechanisms, and ethical frameworks we see in well-ordered societies, could, in monitoring, evaluating, and ultimately judging the human species as both organizationally…

What Will AI Think? Cogito, Ergo Sum

Posted on June 6, 2025June 6, 2025 by Brian Colwell

It was philosopher René Descartes who famously said “Cogito, ergo sum,” or, “I think, therefore I am”. For context, Descartes was referring to definite existence: “It is not possible for us to doubt that…