Speech-to-Text AI Engineer Salary.
Across 81 U.S. cities.
$150,000
national median salary
$117,000 to $195,000. Last updated June 2026.
Highest Paying
$207,000
San Francisco, CA
Best Purchasing Power
$157,000
New Orleans, LA
Lowest Paying
$113,000
Charleston, WV
Salary data sourced from SEC filings, H-1B Labor Condition Applications (DOL), Bureau of Labor Statistics Occupational Employment and Wage Statistics, and aggregated job postings across 50+ platforms. Ranges reflect 25th to 75th percentile for full-time positions. Cost-of-living adjustments use Bureau of Economic Analysis Regional Price Parities (2025 index). Last updated June 2026. Baseline derived from BLS SOC 15-2051. Full methodology.
The average Speech-to-Text AI Engineer salary in the United States is $150,000 in 2026, with the full range spanning $117,000 at the 25th percentile to $195,000 at the 75th. San Francisco pays the most at $207,000, while New Orleans offers the best purchasing power after cost-of-living adjustments. Compensation for Speech-to-Text AI Engineers is driven by depth of technical specialization, open-source or published work, and the specific technology stack.
Speech-to-Text AI Engineer salary by city.
Skills that increase Speech-to-Text AI Engineer pay.
The skills below command measurable salary premiums for Speech-to-Text AI Engineers based on job posting data. Learning the top skill here could add $21,000 to your annual compensation.
≈ +$21,000 per year
≈ +$19,500 per year
≈ +$18,000 per year
≈ +$16,500 per year
≈ +$15,000 per year
≈ +$15,000 per year
≈ +$15,000 per year
≈ +$13,500 per year
What you should know.
Compensation for Speech-to-Text AI Engineers is driven by depth of technical specialization, open-source or published work, and the specific technology stack. Equity is a major component at roughly 25% of base — candidates should weight stock grants as heavily as salary when comparing offers. Within tech-sector Speech-to-Text AI Engineers specifically, employer tier (FAANG and frontier-AI labs vs mid-stage startups vs traditional enterprise) drives 67%+ variance across the compensation band.
Speech-to-Text AI Engineers typically progress Junior → Mid → Senior → Staff → Principal over 8 to 12 years, with the Staff+ levels carrying significant technical scope and cross-team influence. The director/VP track diverges around year 8 for those who choose management; IC staff-plus roles keep building technical depth.
Total compensation for Speech-to-Text AI Engineers runs roughly $216K at median when factoring base + equity (25% of base annually) + bonus (15% of base). Equity is the single largest non-base component — candidates should model vesting schedules (typically 4-year with 1-year cliff) and compare grant values across offers carefully. At tech companies specifically, equity and sign-on are often the largest delta between offers — two roles with matching base can differ by $100K+ at total when equity is included.
Total compensation breakdown.
Salary by company size
Remote salary adjustment
Remote Speech-to-Text AI Engineers typically earn $140,000 (7% less than on-site). This reflects location-adjusted pay policies at companies using geographic salary bands. Some companies pay flat national rates regardless of location.
Are you a Speech-to-Text AI Engineer?
Share your real compensation anonymously. Help build the most accurate salary dataset for this role. Your data is never individually exposed.
Related tools