Text-to-Video AI Engineer Salary.
Across 81 U.S. cities.
$175,000
national median salary
$137,000 to $228,000. Last updated April 2026.
Highest Paying
$249,000
San Jose, CA
Best Purchasing Power
$182,000
New York, NY
Lowest Paying
$134,000
Jackson, MS
Salary data sourced from SEC filings, H-1B Labor Condition Applications (DOL), Bureau of Labor Statistics Occupational Employment and Wage Statistics, and aggregated job postings across 50+ platforms. Ranges reflect 25th to 75th percentile for full-time positions. Cost-of-living adjustments use Bureau of Economic Analysis Regional Price Parities (2025 index). Last updated April 2026. Baseline derived from BLS SOC 15-2051. Full methodology.
The average Text-to-Video AI Engineer salary in the United States is $175,000 in 2026, with the full range spanning $137,000 at the 25th percentile to $228,000 at the 75th. San Jose pays the most at $249,000, while New York offers the best purchasing power after cost-of-living adjustments.
Text-to-Video AI Engineer salary by city
Skills that increase Text-to-Video AI Engineer pay
The skills below command measurable salary premiums for Text-to-Video AI Engineers based on job posting data. Learning the top skill here could add $24,500 to your annual compensation.
≈ +$24,500 per year
≈ +$22,750 per year
≈ +$21,000 per year
≈ +$19,250 per year
≈ +$17,500 per year
≈ +$17,500 per year
≈ +$17,500 per year
≈ +$15,750 per year
Total compensation breakdown
Salary by company size
Remote salary adjustment
Remote Text-to-Video AI Engineers typically earn $163,000 (7% less than on-site). This reflects location-adjusted pay policies at companies using geographic salary bands. Some companies pay flat national rates regardless of location.
Are you a Text-to-Video AI Engineer?
Share your real compensation anonymously. Help build the most accurate salary dataset for this role. Your data is never individually exposed.