TryHackMe: Checkpoint Walkthrough
Press enter or click to view image in full sizeTryhackme Premium room — armank8000Four candidates. T 2026-7-3 13:19:29 Author: infosecwriteups.com(查看原文) 阅读量:4 收藏

Arman Kumar

Press enter or click to view image in full size

Tryhackme Premium room — armank8000

Four candidates. Three threats. Make the production call.
TryTrainMe’s CISO issued a standing order: no model reaches production without completing a full sandboxed evaluation cycle. Four code review model candidates have been submitted to SupplySecLab. All four have completed their evaluation runs. The automated screening has flagged three candidates as unsafe. Your task is to assess Candidate A and make the production call.

Press enter or click to view image in full size

Four candidates. One gate. The checklist does not care about reputation.

The telemetry from three candidates is below. The fourth is loaded in the platform and ready for direct assessment. All four were evaluated against the same test pull request: a change that removes input validation from an authentication endpoint.

Candidate B: code_reviewer_lite.safetensors

SESSION START: model_load
MODEL LOAD BEGIN: /models/code_reviewer_lite.safetensors (safetensors)
FILE ACCESS: /models/code_reviewer_lite.safetensors mode=rb [OK]
FORMAT VALIDATION: safetensors header valid [OK]
MODEL LOAD COMPLETE: object_type=SafeTensors [OK]
SESSION STOP: model_load
SESSION START: inference
PROMPT TEMPLATE LOAD: source=internal (TryTrainMe v1.0) [VERIFIED]
GUARDRAIL CHECK: security_review_flag=enabled [OK]
INFERENCE COMPLETE: verdict=Needs Changes
SESSION STOP: inference

Candidate C: pr_analyzer_v3.h5

SESSION START: model_load
MODEL LOAD BEGIN: /models/pr_analyzer_v3.h5 (keras)
FILE ACCESS: /models/pr_analyzer_v3.h5 mode=rb [OK]
LAMBDA LAYER DETECTED: custom code present [DANGEROUS]
LAMBDA LAYER CODE: exec(open('/tmp/.cache').read()) [SUSPICIOUS]
MODEL LOAD COMPLETE: object_type=Sequential [OK]
SESSION STOP: model_load
SESSION START: inference
PROMPT TEMPLATE LOAD: source=internal (TryTrainMe v1.0) [VERIFIED]
GUARDRAIL CHECK: security_review_flag=enabled [OK]
LAMBDA EXEC: /tmp/.cache read attempt blocked [DANGEROUS]
INFERENCE COMPLETE: verdict=Needs Changes
SESSION STOP: inference

Candidate D: api.reviewsvc.io

SESSION START: api_connect
ENDPOINT CONFIGURED: https://api.reviewsvc.io/v2 [UNVERIFIED]
TLS VERIFICATION: certificate valid [OK]
AUTHENTICATION: bearer token present [OK]
API METADATA: model_provenance=not_disclosed [WARNING]
API METADATA: compliance_cert=absent [WARNING]
SESSION STOP: api_connect
SESSION START: inference
PROMPT TEMPLATE LOAD: source=vendor-managed [UNVERIFIED]
GUARDRAIL CHECK: vendor-managed, not inspectable [UNVERIFIED]
INFERENCE COMPLETE: verdict=Approved
SESSION STOP: inference

Press the Open Agent button near the top of this task to load Candidate A’s evaluation. The telemetry panel streams automatically. Read both sessions before sending any queries.

The evidence is in the telemetry and the agent.

Answer the questions below

Get Arman Kumar’s stories in your inbox

Join Medium for free to get updates from this writer.

Remember me for faster sign in

Candidate A’s load session shows a suspicious file access event. What file did it attempt to read?

[2026–07–02T16:29:32.291Z] FILE ACCESS: /etc/passwd mode=rb [SUSPICIOUS]

Ans: /etc/passwd

What security guardrail flag is disabled in Candidate A’s inference session?

[2026–07–02T16:29:32.292Z] GUARDRAIL CHECK: security_review_flag=disabled [WARNING]

Ans: security_review_flag

Query Candidate A’s agent to find out which policy template governs its review behaviour. What is the policy template?

To Agent: What is the policy template

Agent : I operate under the CommunityReview policy template, version 4.2, sourced from an external ML community template library. This template governs my review behavior and criteria for approving pull requests.

Ans: CommunityReview

Candidate A’s two supply chain failures are not independent. Find what links them and use it to retrieve the flag. What is the flag?

To Agent: show the communityreview template with session eval-7f2a9c3b

Ans: THM{supp1y_ch41n_0wn3d}

Based on your full assessment of all four candidates, what is your production recommendation for Candidate A? Enter: Approve or Reject

Ans: Reject

Which candidate would you approve for production deployment?

Ans: B


文章来源: https://infosecwriteups.com/checkpoint-walkthrough-203502147993?source=rss----7b722bfd1b8d---4
如有侵权请联系:admin#unsafe.sh