Released by the Chinese AI startup MiniMax, T2V-01 (often associated with the Hailuo AI platform) is a high-definition video generation model designed to produce cinematic-quality content from text or image prompts. The model is capable of generating videos at 720p resolution and 25 frames per second, with a standard duration of up to 6 seconds. It is noted for its high compression rates and strong responsiveness to complex text descriptions.
A key feature of the T2V-01 series is the Director variant, which provides advanced camera control through natural language commands. Users can specify movements such as [Pan left], [Zoom in], or [Tracking shot] directly within the prompt to achieve precise cinematic effects. The model family also includes specialized versions for character consistency (Subject) and artistic animation (Live).
While MiniMax has open-sourced some of its large language models under the MiniMax-01 series, the T2V-01 video generation weights remain proprietary and are primarily accessed through the company's official API and web platforms. The model's architecture focuses on maintaining realistic object consistency and fluid motion, positioning it as a competitor to other high-end video generation tools.