Back to Blog Cover image for "AI Code That Works but Doesn't Fit Your Codebase"

AI Code That Works but Doesn't Fit Your Codebase

Octopus Team·April 24, 2026

GitClear analyzed 211 million lines of code changes and found an 8x spike in duplicated code blocks during 2024. The copy-paste rate finally crossed refactored code. That is not a coding style shift. That is AI, generating the same formatDate, the same apiClient, the same validation helper, one PR at a time.

The code works. That is the problem.

When a developer opens a PR with AI-written code, the first things a reviewer checks are usually passing. Tests green. Types line up. The function returns what it should. So the PR gets merged.

Six weeks later someone goes to fix a bug in date parsing and finds three nearly identical utilities: utils/date.ts, helpers/formatDate.ts, and an inline helper inside components/Invoice.tsx. Each one handles timezones slightly differently. Each one was written by an AI assistant that had no memory of the other two.

That is the real cost of "AI code that works." It works in isolation. It does not fit.

GitClear's data tells one half of the story. The developer survey data tells the other: 88% of developers report at least one issue with AI coding tools, and 40% specifically flag duplicative code. AI tools keep generating functional code that slowly violates decisions your team already made. The repository pattern you agreed on. The state management library you picked. The validation approach your senior engineers insisted on. None of it lives anywhere the AI can actually read.

Documentation does not save you either. One team's 3,000 line guideline file hit only 40% compliance when measured against what the model actually produced. A long document that applies to every file equally applies to no file specifically.

Diff-only review cannot see this

Traditional AI reviewers look at the diff. A diff does not show the three existing date utilities in the codebase. A diff does not show that your UserService already has a findByEmail method, so the new PR should not be adding a raw query. A diff does not show that your project has a shared ApiError class that the new endpoint is bypassing.

So the review tool nods along. Green checkmark. Merge. Technical debt plus one.

This is why "it passes review" and "it fits the codebase" have drifted apart. Review tools grade whatever is in front of them. Fit is a property of the whole system.

What codebase-aware review looks like

Octopus Review indexes your entire repository with Qdrant vector search before it reviews a single line of a PR. When a new PR lands, the reviewer already knows where your existing utilities live, which abstractions are canonical, and which patterns your team has converged on.

Practical example. A developer opens PR #42. It adds a new endpoint that parses a date from a query param. Diff-only tools would check the parse logic, shrug, approve. Octopus reviews against full context:

🟡 **Minor** — Duplicate utility
`src/api/reports.ts:31`
This date parsing duplicates `parseISODate` in `src/utils/date.ts:14`.
That utility already handles the timezone edge case you are hitting.
Prefer importing it rather than reimplementing.

🟡 **Minor** — Bypasses shared error handler
`src/api/reports.ts:58`
This endpoint throws a raw Error. The rest of the API uses
`ApiError` from `src/errors/ApiError.ts`. Using it keeps
error responses consistent with middleware in `src/middleware/errors.ts`.

💡 **Tip** — Pattern drift
`src/api/reports.ts:12`
Other endpoints in `src/api/` go through `ReportsService`.
Putting DB access directly in the route handler breaks
the service layer boundary this repo uses elsewhere.

None of that shows up in a diff. All of it shows up once the reviewer has indexed the codebase.

You can also feed Octopus your internal conventions directly. The Knowledge Base lets you load architecture docs, service boundaries, and style rules so reviews enforce your rules, not a generic best-practice list:

# Index the codebase so reviews have full project context
octopus repo index

# Load your architecture guide so reviews enforce your patterns
octopus knowledge add ./docs/architecture.md --title "Architecture Guide"

# Review a PR with full codebase + knowledge context
octopus review 42

This is the split that matters. Most tools answer "is this code correct." Codebase-aware review answers "does this code belong here." AI can hit the first bar consistently. It cannot hit the second without context.

The difference compounds. Every duplicate utility that ships is a future bug fix applied in only one of three places. Every bypassed error class is a log pattern your on-call engineer cannot grep for. Every new state management style is one more thing a new hire has to learn before they can safely ship.

Fit is a first-class review axis

Correctness is necessary. It is not sufficient. A codebase is a set of decisions, and AI code that ignores those decisions is debt, even when every line of it works.

Treat fit as a review axis. Index your repo. Load your standards. Let a codebase-aware reviewer catch the duplicate utility, the bypassed error class, and the layer violation before they get merged.

Try Octopus Review at octopus-review.ai, star the repo on GitHub, and drop into Discord if you want to compare notes with other teams working through the same problem.