Codility Proctoring & Integrity

Reviewable evidence.
Human accountability_

When every candidate has access to AI tools and screen-capture-invisible cheating apps, a test score alone is not enough. Codility delivers layered integrity signals that give reviewers evidence they can act on.

Multi-layered signals across behaviour, identity, similarity, and visual dimensions

Aggregated Integrity Risk that surfaces the candidates worth reviewing

AI enabled or disabled per assessment, with every interaction captured as reviewable AI activity

The challenge

Assessment integrity is harder
than it used to be

Candidates now have access to AI coding assistants, screen-capture-invisible tools, and solution databases. Whether you enable AI assistance or switch it off, a single integrity check is no longer sufficient. Your process needs layered, reviewable evidence at every stage.

AI tools rewrite candidate behaviour

Candidates can generate, paste, or retype AI-generated solutions. Traditional paste detection alone does not surface these patterns.

Cheating apps evade screen sharing

Apps like InterviewCoder and Cluely request exclusion from screen capture, so they stay invisible to standard proctoring.

Manual review does not scale

Reviewing video for every candidate is costly and slow. Teams need aggregated signals that surface the sessions worth reviewing.

One signal is not enough

No single check covers every scenario. A candidate might pass paste detection but fail identity verification. Only layered evidence gives the full picture.

The shift

From a number to reviewable evidence

Without layered integrity	With Codility Proctoring & Integrity
A result with no supporting evidence	Behavioural, identity, and similarity signals in one reviewable record
Paste detection only, blind to AI retyping patterns	Typing Pattern Detection flags line-by-line retyping from external AI sources
No identity verification before the assessment starts	Government-issued ID verification before the first task
Manual video review for every candidate	Aggregated Integrity Risk surfaces the sessions that need review
No visibility into screen-capture-invisible apps	Cheating apps detection surfaces apps that exclude themselves from screen capture, for human review
No way to verify the candidate understood what they submitted	AI Follow-Up Questions ask candidates to explain their reasoning, on the spot

What’s in the platform

Layered integrity signals across every dimension

Behavioural Proctoring

Automatic detection of suspicious signals: code pasted from external sources, task description copied, time away from the IDE, and abnormally fast completion. In Interview, signals surface live to the interviewer during the session.

Screen Interview Skills Intelligence

Code Evolution Chart

Visual timeline of how the candidate’s code changed over time. Proctoring events are overlaid on the code history, surfacing sudden large insertions and other suspicious patterns.

Screen Interview

Similarity Check

Automated cross-candidate plagiarism detection. Compares submissions against other candidates’ solutions and flags matches for manual review. Positive matches automatically elevate Integrity Risk to High.

Screen Skills Intelligence

Premium

ID Verification

Candidates verify their identity with a government-issued ID before the assessment begins. Results feed into Integrity Risk under the Identity and Network category.

Data handling reference →

Screen Interview

Premium

Visual & Video Proctoring

Periodic webcam and screen snapshots during Screen assessments. Full webcam, screen, and audio recording is also available. Interview includes session recording and full transcript as a premium feature.

Data handling reference →

Screen Interview Skills Intelligence

Network IP Check

Checks whether a candidate used multiple IP addresses during an assessment. A single IP means no risk. Multiple IPs flag medium risk, adding another data point to the Integrity Risk calculation.

Screen

Typing Pattern Detection

Identifies candidates who retype AI-generated solutions line by line from an external source. Analyses typing behaviour after submission, with zero disruption during the assessment. Fully passive, requires no setup, and covers most coding task types.

Screen · Preview

AI Follow-Up Questions

Automatically generated questions that ask candidates to explain the choices and reasoning behind their submitted code. Surfaces candidates who can produce AI-generated output but cannot defend it. Deliberately not scored, so the signal stays human-judged and bias-free.

Screen · Preview Interview · Preview Skills Intelligence · Preview

Cheating apps detection

via the Codility Desktop App

A lightweight desktop app candidates install before the assessment. It is designed to surface apps that request exclusion from screen capture, the technique tools like InterviewCoder and Cluely use to stay invisible to standard proctoring, and hand that evidence to a human reviewer. Detected apps and detection times appear in the candidate report.

Screen · Preview Interview · Preview

Unified intelligence

Layered signals. One reviewable risk level.

Integrity Risk aggregates behavioural, identity, network, and similarity signals into a single risk level: None, Low, Moderate, or High. It appears on every candidate report and in the Test Mission Control view.

This is decision support, not a verdict. Reviewers see which signal categories contributed to the risk level and can drill into the underlying evidence. The final call always stays with a human.

None Low Moderate High

How it works

Configure once. Evidence collects automatically.

Step 1

Configure integrity settings

Choose which proctoring levels to enable when creating the test: behavioural signals, ID verification, visual snapshots, or full video recording. Settings lock once the first candidate is invited.

Step 2

Candidates complete the assessment

Integrity signals are captured automatically during the session. Behavioural events, code changes, snapshots, and video are recorded without disrupting the candidate experience.

Step 3

Review the evidence

The Integrity Risk widget on each candidate report shows the risk level and the underlying signals. Drill into code evolution, video playback, or specific behavioural events.

Product availability

Where each signal is available

Feature	Screen	Interview	Skills Intelligence
Behavioural Proctoring	✓	✓	✓
Code Evolution Chart	✓	✓
Similarity Check	✓		✓
Integrity Risk	✓		✓
ID Verification	Premium	Premium
Visual & Video Proctoring	Premium	Premium	Premium
Network IP Check	✓
Typing Pattern Detection	Preview
AI Follow-Up Questions	Preview	Preview	Preview
Cheating apps detection	Preview	Preview

Interview recording and transcript available as a premium feature. Behavioural proctoring signal coverage may vary by Interview environment type.

What reviewers see

Evidence you can act on after every assessment

Every completed assessment produces structured integrity data. Reviewers see the full picture without watching every recording.

Integrity Risk widget

Aggregated risk level on every candidate report. See which signal categories contributed and drill into the evidence.

Code evolution timeline

Visual history of every code change, overlaid with paste events, focus changes, and other behavioural signals.

Similarity report

Cross-candidate plagiarism status with simplified labels. Positive matches automatically elevate Integrity Risk to High.

ID verification status

Verified, failed, or not attempted. Status feeds directly into the Identity and Network signal category.

Video and snapshot playback

Webcam, screen snapshots, and full session recording viewable alongside the code timeline.

Typing pattern flag

Risk flag when code was entered in a line-by-line pattern that suggests retyping from an external AI source. Passive, post-submission.

AI follow-up responses

Candidate answers to AI-generated reasoning questions, attached to the submission. Reviewers judge depth of understanding directly. Not scored, by design.

Detected apps report

Apps that requested exclusion from screen capture, with detection times, surfaced in the candidate report for human review.

Integrity Risk level Code evolution Typing pattern flag AI follow-up responses Similarity status ID verification Video playback Cheating apps detection API export

Reviewable evidence.Human accountability_

Assessment integrity is harderthan it used to be