Path-Finding is a specialized model extraction attack that targets tree-based machine learning models, such as decision trees and random forests, exploiting confidence values and using the rich information provided by APIs on…
Author: Brian Colwell
An Introduction To AI Side-Channel Attacks
Side-Channel Attacks exploit unintended information leakage through observable physical or logical system behaviors such as memory usage, timing information, power consumption, or electromagnetic emissions. Rather than directly querying the model, these attacks…
An Introduction To Defenses For AI Side-Channel Attacks
A side-channel attack is a security exploit that targets information gained from the implementation of a system, rather than attacking the system’s functionality directly. These attacks extract sensitive information by observing the…
Recommendations To Anthropic On Claude’s Constitutional Principles
The primary sources Anthropic utilized in designing Claude’s Constitution include: The Universal Declaration of Human Rights (UDHR), Apple’s Terms Of Service, DeepMind’s Sparrow Rules, and Anthropic research sets 1 & 2. It…
What Are The Principles Upon Which The Constitution Of Anthropic’s Claude Is Built?
Below the reader will find Claude’s complete set of principles from ‘Claude’s Constitution’ dated May 9, 2023. Before we get into the principles, however, Anthropic wants to “emphasize that our current constitution…
Constitutional AI Aligns Anthropic’s Claude With Human Values
The dawn of artificial general intelligence (AGI) brings with it a complex landscape of profound ethical risks, from bias and regulatory uncertainties to the looming threat of manipulation and AI weaponization. At…
Sentient’s Loyal AI Resolves AGI Ethical Risks?
The primary ethical risks associated with AGI include bias & discrimination, existential risks, governance & regulation, manipulation & loss of human autonomy, and weaponization. It is only through cooperation, decentralized systems, and…
What Are The Ethical Risks Of Strong AI?
The potential for bias, discrimination, and the existential threats posed by misalignment between AGI and human values are deeply concerning. The purpose of this blog post is to highlight clearly and concisely…
Is AGI An Asymmetric Threat?
A strong AI with autonomy, but without the alignment, control mechanisms, and ethical frameworks we see in well-ordered societies, could, in monitoring, evaluating, and ultimately judging the human species as both organizationally…
What Will AI Think? Cogito, Ergo Sum
It was philosopher René Descartes who famously said “Cogito, ergo sum,” or, “I think, therefore I am”. For context, Descartes was referring to definite existence: “It is not possible for us to doubt that…