model safety

EU’s AI Transparency Code: What Tech Giants Will Dislike image

July 10, 20253.7k0

EU’s AI Transparency Code: What Tech Giants Will Dislike

The European Commission has published a voluntary code of practice under the landmark EU ...

Misalignment on a Budget: Finetuning and Steering Vectors image

June 10, 20252.2k0

Misalignment on a Budget: Finetuning and Steering Vectors

Published on June 8, 2025 3:28 PM GMT TL;DR We reproduce emergent misalignment (Betley ...

AI Moratorium Fails: Amodei’s Push for Transparency Standards image

June 5, 20257.6k0

AI Moratorium Fails: Amodei’s Push for Transparency Standards

By expanding on technical challenges, regulatory comparisons, and expert perspectives, we explore why a ...

© 2026 Web Crafting Code