I used Kiro.dev for 5 days to complete my hackathon project (analyzing GitHub repositories).

My quick eval: Kiro feels like a below mid-level dev.

How Kiro stacks up vs a senior dev

Here’s how Kiro stacks up vs. a senior dev (based on my 25 years in software):

What Kiro does

What a senior does

Ideas to fix Kiro

Chooses a random repo with 100+ forks

Chooses a few smaller repos

Ask Kiro to research repos suitable for testing

Implements one big, long-running command (multiple requests per fork)

Plans smaller steps: show-info, list-forks. Tests step by step → sees most forks are empty and skips them

Ask to force task decomposition (Kiro resists) and break all processes into smaller steps

Plans redundant, unnecessary, undisclosed features

Plans only what’s needed

Ask to stay minimal when planning features

Rewrites raw data into vague, emotional, emoji-heavy text

Reports raw data as is

Ask to transmit entity names/data directly, without rephrasing

Loses insights during planning/implementation

Keeps track of all key details

Summaries + separate notes sessions

Starts coding immediately during “specs” discussion

Plans first

Use separate sessions, ask for summaries, and store notes in a separate file

Ignores instructions (agent steering rules)

Gets fired

Must follow rules or refund

Creates new specs for tiny features instead of extending existing ones

Creates a new package only if reusable

Must respect current session scope

Crashes but still marks task as “completed”

Gets fired

Must either finish properly or refund

Outputs “successful all done complete” placeholders as results

Raises NotImplementedError

Should always raise for unimplemented features

Does a sloppy job

Hunts for a new job

Hopefully more careful with smaller tasks

Never runs proper tests

Runs thorough tests

Ask for full test coverage - but beware, your budget may vanish fast

Not ready to ship autonomously

Can work autonomously

Add more rules - will it help?

Burns through your budget for only uncertain results

Delivers within budget

Pricing should reflect useful results, not wasted usage

Final

My opinion: Kiro isn’t ready to work fully autonomously. It burns through budget fast, delivers only so-so results, and needs tighter rules plus better pricing to be truly useful.

Will I hire Kiro? Definitely yes. We need agents with different angles - just like people - to handle different tasks.