AI Safety ; AI SGE Sri Lanka

What is AI Safety?

AI Safety is the field of research and practice concerned with ensuring that artificial intelligence systems behave as their designers intend; and that they remain beneficial as they become more capable.

Unlike traditional software bugs that produce obvious errors, AI systems can fail in subtle, unpredictable ways. A system optimizing for a narrowly defined objective might achieve it in ways that are technically correct but deeply harmful. Safety research tries to prevent this ; before it happens at scale.

The challenge is particularly acute for advanced AI systems that can learn, adapt, and act autonomously across diverse environments.

Key Risk Areas

Misalignment: AI systems that pursue goals subtly different from what we intended; potentially with powerful, unwanted consequences.
Specification Gaming: When AI finds "loopholes" in its objectives, achieving the letter but not the spirit of its design.
Distributional Shift: Systems trained in one environment failing unpredictably when the real world differs from training conditions.
Scalable Oversight: How do we verify that very capable AI systems are doing what we want, when they may be too complex for humans to fully oversee?
Emergent Behaviors: Capabilities that appear unexpectedly as AI systems scale; not anticipated by their designers.

Why it Matters for Sri Lanka

Sri Lanka is adopting AI in public services, financial systems, and healthcare. These are high-stakes domains where AI failures can harm people directly. Building awareness of safety principles now; before large-scale deployment; is far more effective than retrofitting safety after harm has occurred.

Sector Example: In Agriculture, we can deploy "Red-Teaming" to prevent data poisoning in crop-yield prediction models, ensuring that food security decisions are based on robust, unmanipulated data.

Additionally, as a community connected to a global movement, Sri Lanka practitioners can contribute to international safety research and standards; shaping global norms that will inevitably affect us.

🛡️ AI Safety

What is AI Safety?

Key Risk Areas

Why it Matters for Sri Lanka

📚 Introductory Reading

🔗 Key Organizations

Discuss AI Safety with our community