Despite rapid advances in text-to-video generation, state-of-the-art generative models still suffer from producing temporally incoherent and unrealistic motion for videos. The key weakness of existing works is that they commonly treat videos as frame sequences and directly adopt Flow Matching objectives, which are originally designed for images. This practice fails to explicitly model motion priors or temporal dependencies, resulting in suboptimal dynamics that may appear incoherent and unrealistic. To solve this problem, we propose Temporal-aware Flow Matching (TFM), a novel training paradigm that embeds inter-frame constraints into the flow objective, leading to temporally coherent motion modeling in video generation. More specifically, the proposed TFM enforces temporal correlations across frames while retaining the desirable properties of Flow Matching, and further introduces a residual-type loss that aligns naturally with this new flow. We theoretically prove that models trained with TFM are able to exhibit remarkably enhanced temporal perception ability and better capture motion dynamics. Notably, TFM imposes no additional cost during inference and is applicable to any model using Flow Matching. Extensive experiments demonstrate that our TFM can significantly improve motion realism across diverse motion types.
| CogVideoX1.5-5B | HunyuanVideo-13B | Wan2.1-T2V-14B | Temporal Flow Matching (Ours) | |||
A chef flips a cast-iron skillet, sautéed mushrooms sailing and tumbling in glossy butter before settling back.
| CogVideoX1.5-5B | HunyuanVideo-13B | Wan2.1-T2V-14B | Temporal Flow Matching (Ours) | |||
A sprinter explodes from orange-marked blocks, pumping athletic arms at sunrise across a dewy stadium.
| CogVideoX1.5-5B | HunyuanVideo-13B | Wan2.1-T2V-14B | Temporal Flow Matching (Ours) | |||
A renowned sushi chef slices a roll with a single, precise motion.
| CogVideoX1.5-5B | HunyuanVideo-13B | Wan2.1-T2V-14B | Temporal Flow Matching (Ours) | |||
A figure skater leaps into the air, his skates gliding across the ice.
| A close-up of a runner's legs as they dash through a rainstorm, their shoes splashing through puddles as they push forward with determination. | A gymnast balancing on a balance beam, body rotating in a tight, precise arc. | A golden retriever shakes itself dry after jumping into a backyard pool, droplets spraying out like bright, shimmering stars. | ||
| A young man performing a cartwheel on a gray surface. He is dressed in orange pants, a black t-shirt. | A skateboarder launches off a ramp and executes a complex mid-air trick before landing. | A robot arm delicately moves colored chess pieces in a grandmasters’ match. | ||
| A dog leaps into a pile of autumn leaves on a park path, scattering golden and red foliage high into the air. | A man is jumping rope on the sandy beach. | A soccer player skillfully juggles a ball with their feet. | ||