AI Safety

Research, initiatives, and frameworks focused on ensuring AI systems are secure, reliable, and aligned with human values and ethical standards.

OpenBox AI and Mastra Add Default Runtime Governance to TypeScript Agents

OpenBox AI and Mastra have partnered to integrate runtime governance directly into the Mastra TypeScript agent framework, enabling compliance-ready oversight and audit capabilities by default.

May 05, 2026

CollectivIQ Expands Platform to Address AI Hallucination and Bias

CollectivIQ has announced major upgrades to its AI consensus platform, adding multimodal image generation, integrated payment capture, and expanded retrieval capabilities to enhance accuracy, collaboration, and governance.

May 05, 2026

OPAQUE Acquires Cryptographic AI Technology from Abu Dhabi's TII

OPAQUE has acquired cryptographic AI technologies from Abu Dhabi’s Technology Innovation Institute, expanding its Confidential AI platform with post-quantum protections for secure model training and deployment.

May 04, 2026

ITRAK 365 Introduces ITRAK IQ for AI Integration in EHS Workflows

ITRAK 365 has launched ITRAK IQ, an AI integration framework designed to connect Environment, Health and Safety (EHS) and operational data with AI assistants within a customer's Microsoft environment. The platform enables secure, governed AI use without sending data to external services.

April 29, 2026

Kibu Raises $10.5 Million to Expand Its Human Verification Network

Kibu, a human network platform focused on verified digital interactions, has raised $10.5 million in seed funding led by Cubit Capital and Construct Capital to scale its trust-focused technology.

April 29, 2026

MythWorx Introduces NeuroWorx Reasoning Engine

MythWorx has launched NeuroWorx, a verifier-first AI reasoning engine that runs on CPUs and consumes about 2% of the power used by large language models. The company is opening an invite-only private preview for selected organizations.

April 29, 2026

SandboxAQ CEO Warns of Converging GPS and AI Cyber Threats at Davos

At the World Economic Forum in Davos, SandboxAQ CEO Jack Hidary warned that GPS interference and AI-enabled cyberattacks are merging into a single threat to critical infrastructure. He highlighted the company's technologies AQNav and AQtive Guard as tools to address these risks.

April 29, 2026

National Safety Council and Wolters Kluwer Enablon Report Finds Rapid AI Adoption in Workplace Safety

A new report by the National Safety Council and Wolters Kluwer Enablon shows that AI use in environmental, health, and safety programs is expanding quickly, though most professionals call for clear guardrails and stronger digital foundations.

April 28, 2026

KBR Wins $200 Million Contract to Modernize U.S. Transportation with AI

KBR has secured a $200 million contract from the U.S. Department of Transportation's Volpe Center to enhance aviation safety and modernize transportation systems using AI and data analytics.

April 28, 2026

NSS Labs Introduces AI Protection Systems Test Framework

NSS Labs has released the AI Protection Systems test methodology, a framework for evaluating the security of enterprise AI deployments across eight key dimensions, including prompt injection resistance and data exfiltration prevention.

April 28, 2026

Snow Crash Labs CEO Warns of U.S. AI Infrastructure and Safety Risks

Matt O'Brien, CEO of Snow Crash Labs, said in a podcast interview that the U.S. is falling behind China in power capacity needed for AI infrastructure and that enterprises are deploying models without adequate quality control, creating major safety and compliance risks.

April 24, 2026

Singapore Proposes First Global Standard for Generative AI Testing

Singapore has proposed ISO/IEC 42119-8, the first international standard for testing generative AI systems. The standard focuses on benchmarking and red teaming and will be discussed at the ISO/IEC JTC 1/SC 42 plenary meeting in Singapore.

April 21, 2026

Kroll Finds 76 Percent of Firms Faced AI Security Incidents

Kroll reports that three quarters of organizations have experienced security incidents linked to AI applications or models in the past two years, with over a quarter incurring costs exceeding one million dollars.

April 21, 2026

Inflectra Opens Early Access for SureWire AI QA Platform

Inflectra has launched a free Early Access Program for SureWire, a new quality assurance platform built to test non-deterministic AI agents. The platform introduces specialized testing and judge agents to identify vulnerabilities in AI systems before deployment.

April 13, 2026

Google Updates Gemini with Mental Health Support Features and $30 Million Crisis Funding

Google has introduced new mental health safety features in its Gemini AI system and pledged $30 million to support global crisis helplines. The updates include a redesigned help interface, expanded partnerships, and safeguards for younger users.

April 08, 2026

Safetensors Joins PyTorch Foundation to Strengthen AI Model Security

The PyTorch Foundation has added Safetensors, developed by Hugging Face, as its newest project to improve secure AI model distribution and prevent arbitrary code execution risks.

April 08, 2026

Chai AI Deploys 5,000-GPU Cluster for Model Alignment and Safety Research

Chai AI has deployed a 5,000+ GPU cluster to enhance post-training research focused on large-scale model alignment and safety. The company also formalized new global compliance measures, including age gating and regional verification protocols.

March 27, 2026

IntelePeer Earns TX-RAMP Level 2 Certification for AI Communication Solutions

IntelePeer has achieved TX-RAMP Level 2 certification, confirming its compliance with Texas state security standards for cloud-based AI communication systems used in regulated sectors such as healthcare and government.

March 20, 2026

Anthropic Launches The Anthropic Institute to Study AI’s Societal Impact

Anthropic has launched The Anthropic Institute, a new research initiative focused on understanding the societal, economic, and legal challenges posed by powerful AI systems. The institute will combine internal research teams and expand collaborations with experts across disciplines.

March 16, 2026

Resume.org Survey: 21% of U.S. Companies Halt Entry-Level Hiring Due to AI

A new Resume.org survey finds that 21% of U.S. companies have stopped hiring entry-level workers because of AI, with nearly half expecting to do so by 2027.

March 10, 2026

Subscribe to AI Policy Brief

Weekly report on AI regulations, safety standards, government policies, and compliance requirements worldwide.