AI/Deep Dive

Python and JavaScript Are Winning the AI Race

AI models trained on public code are quietly turning Python and JavaScript into the default languages of software, and enterprise stacks risk being left behind.

Olympia Tech··4 min read

AI is not just changing how we code. It is starting to dictate what we code with. As large language models become the default pair-programmer for millions of developers, the languages those models know best are quietly becoming the languages everyone reaches for. And the models know Python and JavaScript best of all.

This is not a future scenario. It is already happening, and the mechanism behind it is simpler than the hype suggests: AI learns what we teach it, and we have taught it mostly Python and JavaScript.

The Data Decides the Winner

LLMs are trained predominantly on public code. They scrape open repositories, parse questions and answers, and absorb the patterns of whatever languages appear most often in that public corpus. Python and JavaScript dominate GitHub and Stack Overflow, so they dominate the training data too.

The result is disproportionate fluency. Models become frighteningly good at generating code in these languages, often faster and more accurately than human developers. Studies cited in the original analysis put numbers on the bias: LLMs lean toward Python in roughly 90 to 97 percent of language-agnostic tasks, and Python is still chosen in about 58 percent of project-initialization tasks even when it is not the optimal tool for the job.

When the language is left up to the model, the model almost always picks Python. That default is no longer a preference. It is becoming gravity.

That default is not a neutral observation. It shapes which ecosystems get more sample code, more tutorials, and more tooling, which in turn feeds the next generation of training data.

The Flywheel Nobody Voted For

What makes this dynamic hard to reverse is its self-reinforcing nature. The cycle looks like this:

  • Models trained on abundant public code get better at Python and JavaScript.
  • Better AI assistance makes those languages more productive to write.
  • More developers choose them, generating still more public code.
  • The next model trains on that larger corpus and gets even better.

Each turn of the wheel widens the gap. Languages that start ahead pull further ahead, not because they are technically superior, but because they are better represented in the data that teaches the tools we now depend on.

The Enterprise Disadvantage

The flip side is a real risk for languages whose code lives behind closed doors. Java, C#, and frameworks like SwiftUI suffer from proprietary codebases and comparatively limited public examples. The AI cannot learn what it cannot see.

In practice, this means AI-generated code in these languages is less reliable, more error-prone, and slower to evolve. The model has simply seen fewer correct patterns to imitate. For enterprises that have standardized on these stacks, the productivity gains their competitors enjoy from AI assistance arrive late, if at all.

The Real Competitive Threat

It is tempting to frame the danger as no-code tools replacing developers. That framing misses the point. The genuine threat is more specific and more uncomfortable: developers who embrace AI tools and work in well-represented, AI-native languages can ship roughly three times faster than peers stuck in less-represented ones.

The divide is not human versus machine. It is AI-aligned developers versus everyone else.

What Enterprises Can Actually Do

The situation is not hopeless, but it requires deliberate action rather than passive hope that the models will catch up on their own. The original piece lays out a practical playbook for organizations tied to enterprise languages:

  • Increase open-source contributions in enterprise languages so models have more public examples to learn from.
  • Develop private LLMs trained on internal codebases, turning proprietary code from a liability into a training asset.
  • Build language-specific AI development tools tuned to the stack you actually use.
  • Invest in comprehensive technical documentation that captures correct patterns explicitly.
  • Educate communities about AI readiness so the ecosystem improves its own visibility.

The through-line is visibility. If a language wants strong AI support, its patterns need to exist somewhere the models can learn them, whether that is the public internet or a private corpus you build yourself.

What to Watch

The key question for the next few years is whether the flywheel keeps accelerating or whether deliberate intervention can slow it. Watch for enterprises standing up private models trained on their own code, because that is the most direct counter to the public-data advantage. Watch whether communities around Java, C#, and Swift mobilize to expand their open-source footprint. And watch the model defaults themselves: if the Python bias in language-agnostic tasks softens, it will be a sign that the broader ecosystem is closing the gap.

For now, the lesson is blunt and worth repeating. AI learns what we teach it. The languages winning the AI race are winning because we gave the models more of them to study, and any language that wants to compete has to decide, consciously, to be taught.

More from AI

Deep DiveAI

Closing the Gap: How Open-Weight Models Caught the Frontier

Three years ago the best downloadable model trailed the frontier by more than a year. In 2026 that lag is measured in months, and a wave of open-weight labs, most of them Chinese, is compressing the distance faster than almost anyone predicted.

Olympia Tech · 10 min
Deep DiveAI

AI Agents: Unlocking the Next UX Revolution

The next leap in user experience is not prettier interfaces but delegation, where autonomous agents act on our intent so we touch less and offload more.

Olympia Tech · 4 min