SafeWork-T1: A Safety Reasoning Training Accelerator for Multimodal Large Models

When AI Training Meets “Food Delivery”: Why Legacy Systems Struggle? Now envision an AI-powered “delivery dispatcher” that must excel in two areas: Delivering orders quickly (general capability) Simultaneously monitoring riders for violations like speeding or running red lights (safety/trustworthiness) Yet traditional training frameworks face multiple limitations: Compartmentalized workflow: Diverse demands (training, inference generation, validation scoring) must be split across separate “stations” handled by different “riders” (clusters/GPUs). Rigid architecture: Adding new constraints (e.g., safety/knowledge/value validators) often requires major pipeline overhauls or complete rebuilds. Poor scaling: More “riders” (GPUs) paradoxically worsen resource imbalance: some idle while others are overloaded to the point of overheating. To solve these challenges, Shanghai AI Lab’s Center for Safe&Trustworthy AI introduces SafeWork-T1: a multimodal, trustworthiness-focused training platform. This intelligent system processes tasks in the colocate mechanism like a “collapsible, modular, multi-purpose workbench”—resolving all above pain points at once to enable safer, more efficient, and more accurate trustworthiness-enhanced training paradigms. ...

July 21, 2025 · 3 min · Center for Safe&Trustworthy AI

SafeWork-R1: Coevolving Safety and Intelligence under the AI-45° Law

<!DOCTYPE html> 阅读论文样式修改 Read the Paper 1 Introduction Recent advances in large language models (LLMs) have led to significant improvements in their intelligence, particularly in their reasoning and decision-making capabilities [1, 2]. However, these performance gains are often accompanied by an increasing gap between the capability and safety, moving further away from the AI-45° Law [3]. For example, existing LLMs frequently demonstrate difficulty in upholding ethical principles, societal norms, and wider human values, especially when navigating the complexities of real-world scenarios. ...

July 12, 2025 · 7 min · Center for Safe&Trustworthy AI