Stellaris Dev Diary #385 - AI Benchmarks

Voidian · May 30, 2025

demons and magic said:
I know I'm late to this topic, but i just wanted to say my 2 cents.

Can you balance the a.i around the ensign rank (where the ai and players have no inherent bonuses)? If the ai is stronger there it should make higher difficulties more challenging to right?

You're right, in fact, this happened before.

Stellaris AI was a joke for years, until they decided to adjust it, among the many changes they decided to make the AI build a research lab on every planet whenever they had a free slot and nothing to build, as a result the AI got so challenging they had to create a new lowest difficulty under the previous easiest difficulty, and on GA games the AI managed to keep up with the player's research until the endgame.

Of course, they never taught the AI the intermediary player's skills like properly designing specialized worlds, like ringworld segments with nothing but researchers, but it was good enough at higher difficulties.

With 4.0 they undid everything as the entire system is different and specialization is more important with the limited building slots, I'm afraid just throwing a random lab in every world won't be enough this time.

The AI needs to understand and abuse the new systems with lot of urban districts and specializations that make sense.

Seridor · May 30, 2025

I for one, can't even imagine how could anyone write a working AI for this game.. it's bloated with so many options by now.. the AI could play as a generic civ with a generic strategy, but with so many civics, traits, ethics, origins etc, the colony management, especially if efficiency is considered quite complicated, and this is just the economic level. The diplomacy and war management are broken as well, fleet movements for example. That's why I think an aI for this complicated game should play a simplified economic ruleset which would still emulate a working AI instead of a non working one.

Voidian · May 30, 2025

Seridor said:
I for one, can't even imagine how could anyone write a working AI for this game.. it's bloated with so many options by now.. the AI could play as a generic civ with a generic strategy, but with so many civics, traits, ethics, origins etc, the colony management, especially if efficiency is considered quite complicated, and this is just the economic level. The diplomacy and war management are broken as well, fleet movements for example. That's why I think an aI for this complicated game should play a simplified economic ruleset which would still emulate a working AI instead of a non working one.

I constantly pick civs based on themes, often with under optimized traits and origins based on what I feel like playing as.

Granted, traditions are much tighter, there is no game without Supremacy, most of the other choices either give military bonuses or empire size reduction as very little else seem to matter unless I'm specifically trying to create something like an early federation for thematic reasons.

And even though this is how I make my civs I've not played any games under GA difficulty, no scaling, for years.

I feel like the choices & combos, trying to push that extra 10% bonus efficiency from minmaxing don't matter as much as people like to say it does, good priorities, good build orders, good ship templates, good fleet templates, good benchmarks for what you're expected to have at every 10 years matter a lot more than anything else.

Truly "learning to play" stellaris did not involve reading ascension perks or traditions, it happened when I spent a weekend restarting the game to optimize my opening and pop jobs to be able to crush GA difficulty neighbors, genocidal or not, on first contact under 2220.

It's quite mechanical and not all that complicated, if the AI can make good decisions, learn a proper opening, have proper templates for what good, specialized planets should look like, what good fleets and ships should look like, the AI would be extremely challenging even on ensign difficulty.

Peterkleber · May 30, 2025

Other things the AI has to focus independent on "playstyle" of the AI should be to:
-prioritize resettling pops onto colonies for higher pop growth, especially on harder difficulties.
You can aim at 2500 pops on colonies. Any more is a very marginal gain, but do it gradually. Meaning always fill the jobs on the colony and build nonstop on the colonies. Also dont go lower than 3000 pops on the capital.
This will make the AI a lot stronger as it will only take about 11-12 years to increase the pop growth to be almost optimal, instead of the passive almost 50+ years.
Maybe even make this optimal play only available for GA?

- AI should build 1 of each resource district on every planet for resource specialization districts and lower trade reliance.

- Sell excess food and consumer goods
4 surplus/month is enough for colonyships

40 energy/month for resettlement

125 minerals/month for stations and buildings.

Alloy and research is hard to estimate but around 25 alloy per 10 years?
Research should be something like
Y10 100
Y20 300
Y30 600
Y40 1500

Eddi · May 31, 2025

Voidian said:
good priorities, good build orders, good ship templates, good fleet templates, good benchmarks for what you're expected to have at every 10 years matter a lot more than anything else.

i'm not doing any of this. i shouldn't be good at beating GA difficulty AIs. yet i'm here outperforming the AI after only 50 years, with them having no chance to catch up.

thestarsseeall · May 31, 2025

Eladrin said:
Balance has not been a huge target for us since there have been more pressing concerns, but we're doing a balance patch next week to cover some of the most critical issues, like telepaths.

Don't play multiplayer, and haven't had the time to play the most recent update in depth, but I find this statement, and the general balance a little worrying. Of course players will always find ways to min max, but shouldn't balance be one of those "An ounce of preventions is worth a pound of cure" kinda things? Sometimes, from the outside, it seems like numbers are randomly generated and tossed into the game, which is discouraging, and further clouds other problems in the game, like this thread about AI Benchmarking. As others in the thread have mentioned, a min-maxing Veteran on Grand Admiral with all the DLC plays the game very differently than a base game newbie on Ensign.

Some powercreep is inevitable, but do you guys have like a roadmap or general set of guidelines that the team refers to when creating new things? Would be nice if we could see that and get a general sense of what the team thinks the AI behavior/empire development should be like at certain stages in the game to compare to our own experiences.

For example, are minor DLCs options each allowed to be 1.3 times more powerful than a DLC free game's civics, and major DLCs 1.5 times stronger? Are all Ascension Paths supposed to make pops ~3 times stronger and offer x amount of new options? Are Crisis Empires supposed to be ~5 times stronger in general than non crisis empires at their peak? What's the expected upper and lower bounds on these things? Standard Deviation? If a player finds a synergy between a trait, a civic, and a government, what's the expected outcome?

Asking players to report their findings and experiences and trusting their feedback is good, but kinda shooting into the dark, even if they clarify all their settings, with personal playstyles, empire generation, etc. Some developer benchmarks to compare to would be really helpful, so we can know what is intended and what's not, and see what sticks out. If all the 2x buffs and 5x buffs should make 10x strong empires, except one 1000x empire exists in practice, would be easier to pick out if we knew what was the intended limit. I'm thinking of stuff like the autocannon situation, where AI empires overrated autocannons and put them on all their ships, the initial rollout of the leader rework with 50 governors and 0% empire size, or espionage, where sabotaging a 50 alloy starbase module costs 100 influence. Is 1 alloy really worth 2 influence?

Voidian · May 31, 2025

Eddi said:
i'm not doing any of this. i shouldn't be good at beating GA difficulty AIs. yet i'm here outperforming the AI after only 50 years, with them having no chance to catch up.

Exactly, the AI is broken and does not play the game, at all, since 4.0.

Previous GA difficulty AIs from regular empires could beat GA crisis over 5x, solo, and awakened empires the instant they spawned if endgame was set to after 2400, even my own small vassals have destroyed the endgame crisis, by accident, before I could get to them, I had to set them to 10x or higher just to have a chance of fighting them.

Current AIs on GA can't even handle x1 crisis or an awakened empire.

3.14 Stellaris AI finally got decent after a custodian team patch, so much they had they add a new lowest difficulty under the lowest to give players even more bonuses because some people couldn't even play against it with bonuses, and it never even learn how to use ship or fleet designers and just sent random crap in every war, the only thing they really changed was telling the AI to build research labs in empty planet slots whenever they had free room for it, imagine if they had properly taght the AI how to manage resources, use the fleet/ship designers and build proper specialized worlds based off player layouts. Perhaps we wouldn't need GA bonuses to have a fun game.

Too bad the "just build a few labs bro" approach will not work on the 4.0 version of planetary management.

HawkSeraph · Jun 1, 2025

I am very happy that this is being actively worked on, as I have felt that Paradox game AIs are what is holding the SP game experience back the most. Stellaris has a very unfortunate history in that regard due to the various reworks often coming with a total AI dumpster dive.

Using MP players telemetry should be helpful for seeing where human players at what timeframe.

In general I think AI should have strong baseline, with personalities defining what they are *exceptional* at. Materialists should be beating me at science, Aggressive Empires should have a comparable fleet size to me (if not an as technologically advanced, maybe). This might be a bit harder for e.g. diplomatically-focussed empires.

As for the questions laid it, I don't really plan for any specific amount to hit - I tend to adapt to what empire I am playing, what I am given, and what my goals are.

malakhglitch · Jun 4, 2025

I want my functional planetary automation back...

WittleWolfie · Jun 6, 2025

I admit, it's hard for me to think about it in the terms requested (i.e. What production value for X by year Y?).

The way I think about the "ideal" AI is this logic:

1. At the start of the game (and possibly when significant things like Ethics shifts occur), the AI should set long-term goals. They should have a specific Ascension Path in mind as well as a more generic goal. Do they want to have a federation? What type? Do they want to be the Galactic Emperor? Do they want to create a galactic paradise? Do they want to control as much of the galaxy as possible or just a small chunk of it?

2. Long term goals should lead into medium term planning. Medium term planning sets the next specific objective. These could be things like Form Federation, Form the Galactic Community, Meet Empires, Colonize Planets, Find Pre-FTLs, Find Archaeological Sites, etc. These should tie into the long term goals and serve as a guide for short term planning.

3. Short term planning. This is where the micromanagement comes in. Short term planning is about taking actions designed to optimize for the medium term goals. If they want to meet empires or colonize planets, they should focus on exploration. If they want to form a federation and they've met empires, they should boost research, engage in diplomacy and trade, and try to unlock Form Federation tech.

Unless the AI personality is to dominate economically, everything except Naval Cap and Research is really just a means to an end. Meeting specific production targets for other resources isn't useful unless it is benefiting one of those.

I imagine this is where it gets really tricky, because if you tell the AI not to let mineral income go negative, then it will make short term decisions that slow it down long term unless it understands that the income can be subsidized through the market.

In any case, I would expect an AI with this high level approach to perform quite well, as long as its ability to understand the impact of its decisions is good enough. e.g. If the AI doesn't realize that boosting research means building another tech planet, which in turn means boosting CG / mineral output, then it wouldn't work very well. I suspect the AI already has this capability though.

If the AI can understand broadly what its goals are, what the interim steps are, and what actions can bring it closer to the next step, I would expect it to perform quite well. The only other thing I'd do is probably just give it something close to a hard coded understanding of how planetary optimization works and probably weight it against making drastic changes so it doesn't try to respec worlds or build too many mixed use worlds.

P.S. I'd love to see an AI "Personality" that's all about defending the galaxy against crises. If there is one, it doesn't feel like it.

Shirohane · Jun 7, 2025

Would it be possible to provide a way to apply AI mods to each empire individually during the pre-game empire setup screen?

This would allow modders to compete against each other to see which AI strategy is the best.
Paradox could also use these competitions as a reference when improving the official standard AI.

passeris · Jun 7, 2025

Shirohane said:
Would it be possible to provide a way to apply AI mods to each empire individually during the pre-game empire setup screen?

This would allow modders to compete against each other to see which AI strategy is the best.
Paradox could also use these competitions as a reference when improving the official standard AI.

Excellent justification for an extravagant feature request.

Soranya · Jun 16, 2025

WittleWolfie said:
In any case, I would expect an AI with this high level approach to perform quite well, as long as its ability to understand the impact of its decisions is good enough. e.g. If the AI doesn't realize that boosting research means building another tech planet, which in turn means boosting CG / mineral output, then it wouldn't work very well. I suspect the AI already has this capability though.

Yeah - probably not really tbh.

Search

Stellaris Dev Diary #385 - AI Benchmarks

Stellaris 4.0.13 Patch

Improvements

Balance

Bugfix

Performance

Stability

What Makes a Good AI?

Benchmarking

What’s Next?

Voidian

Colonel

Seridor

Second Lieutenant

Voidian

Colonel

Peterkleber

Recruit

Eddi

Second Lieutenant

thestarsseeall

Corporal

Voidian

Colonel

HawkSeraph

Captain

malakhglitch

Rogue Servitor

WittleWolfie

Recruit

Shirohane

Private

passeris

Second Lieutenant

Soranya

The Eyes!

Stellaris Dev Diary #385 - AI Benchmarks

Stellaris 4.0.13 Patch​

Improvements​

Balance​

Bugfix​

Performance​

Stability​

What Makes a Good AI?​

Benchmarking​

What’s Next?​

Colonel

Second Lieutenant

Colonel

Recruit

Second Lieutenant

Corporal

Colonel

Captain

Rogue Servitor

Recruit

Private

Second Lieutenant

The Eyes!

Stellaris 4.0.13 Patch

Improvements

Balance

Bugfix

Performance

Stability

What Makes a Good AI?

Benchmarking

What’s Next?