Reinforcement Learning

When AI Training Meets “Food Delivery”: Why Legacy Systems Struggle? Now envision an AI-powered “delivery dispatcher” that must excel in two areas: Delivering orders quickly (general capability) Simultaneously monitoring riders for violations like speeding or running red lights (safety/trustworthiness) Yet traditional training frameworks face multiple limitations: Compartmentalized workflow: Diverse demands (training, inference generation, validation scoring) must be split across separate “stations” handled by different “riders” (clusters/GPUs). Rigid architecture: Adding new constraints (e.g., safety/knowledge/value validators) often requires major pipeline overhauls or complete rebuilds. Poor scaling: More “riders” (GPUs) paradoxically worsen resource imbalance: some idle while others are overloaded to the point of overheating. To solve these challenges, Shanghai AI Lab’s Center for Safe&Trustworthy AI introduces SafeWork-T1: a multimodal, trustworthiness-focused training platform. This intelligent system processes tasks in the colocate mechanism like a “collapsible, modular, multi-purpose workbench”—resolving all above pain points at once to enable safer, more efficient, and more accurate trustworthiness-enhanced training paradigms. ...

Reinforcement Learning

SafeWork-T1: A Safety Reasoning Training Accelerator for Multimodal Large Models

SafeWork-R1: Coevolving Safety and Intelligence under the AI-45° Law