Skip to content
@AI45Lab

OpenAI45Lab

Welcome 👋

to AI45, a safety ecosystem platform developed by Shanghai Artificial Intelligence Laboratory.

Core Philosophy

The platform is guided by the AI-45° Law. From a long-term perspective, AI safety and performance should ideally advance in parallel along a 45° line. Short-term fluctuations are permissible, but in the long run, this balance should neither fall below 45° (as at present) nor exceed it (to avoid constraining development).

Multiple technical pathways may achieve this “AI-45° Law”. We are exploring a causality-centered approach—“the Causal Ladder of Trustworthy AGI"—spanning three progressive layers: Approximate Alignment Layer, Intervenable Layer, and Reflectable Layer.'

Core Modules

🔬 Safety Foundation

🛡️ Safety Technology

🏆 Safety Evaluation

🌐 Safety Services

Popular repositories Loading

  1. OpenRT OpenRT Public

    Open-source red teaming framework for MLLMs with 37+ attack methods

    Python 215 9

  2. AgentDoG AgentDoG Public

    A Diagnostic Guardrail Framework for AI Agent Safety and Security

    Python 174 9

  3. ActorAttack ActorAttack Public

    Python 121 10

  4. Awesome-Trustworthy-Embodied-AI Awesome-Trustworthy-Embodied-AI Public

    JavaScript 91 2

  5. REEF REEF Public

    The repository of the paper "REEF: Representation Encoding Fingerprints for Large Language Models," aims to protect the IP of open-source LLMs.

    Python 74 8

  6. Flames Flames Public

    Flames is a highly adversarial benchmark in Chinese for LLM's harmlessness evaluation developed by Shanghai AI Lab and Fudan NLP Group.

    63

Repositories

Showing 10 of 36 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…