I used Kiro.dev for 5 days to complete my hackathon project (analyzing GitHub repositories).
My quick eval: Kiro feels like a below mid-level dev.
- Coding skills: mid-level
- Engineering skills: below mid
- Discipline: below mid
How Kiro stacks up vs a senior dev
Here’s how Kiro stacks up vs. a senior dev (based on my 25 years in software):
What Kiro does |
What a senior does |
Ideas to fix Kiro |
---|---|---|
Chooses a random repo with 100+ forks |
Chooses a few smaller repos |
Ask Kiro to research repos suitable for testing |
Implements one big, long-running command (multiple requests per fork) |
Plans smaller steps: show-info, list-forks. Tests step by step → sees most forks are empty and skips them |
Ask to force task decomposition (Kiro resists) and break all processes into smaller steps |
Plans redundant, unnecessary, undisclosed features |
Plans only what’s needed |
Ask to stay minimal when planning features |
Rewrites raw data into vague, emotional, emoji-heavy text |
Reports raw data as is |
Ask to transmit entity names/data directly, without rephrasing |
Loses insights during planning/implementation |
Keeps track of all key details |
Summaries + separate notes sessions |
Starts coding immediately during “specs” discussion |
Plans first |
Use separate sessions, ask for summaries, and store notes in a separate file |
Ignores instructions (agent steering rules) |
Gets fired |
Must follow rules or refund |
Creates new specs for tiny features instead of extending existing ones |
Creates a new package only if reusable |
Must respect current session scope |
Crashes but still marks task as “completed” |
Gets fired |
Must either finish properly or refund |
Outputs “successful all done complete” placeholders as results |
Raises NotImplementedError |
Should always raise for unimplemented features |
Does a sloppy job |
Hunts for a new job |
Hopefully more careful with smaller tasks |
Never runs proper tests |
Runs thorough tests |
Ask for full test coverage - but beware, your budget may vanish fast |
Not ready to ship autonomously |
Can work autonomously |
Add more rules - will it help? |
Burns through your budget for only uncertain results |
Delivers within budget |
Pricing should reflect useful results, not wasted usage |
Final
My opinion: Kiro isn’t ready to work fully autonomously. It burns through budget fast, delivers only so-so results, and needs tighter rules plus better pricing to be truly useful.
Will I hire Kiro? Definitely yes. We need agents with different angles - just like people - to handle different tasks.