๐Ÿ”ฌ ScholarEnv

The first RL environment for AI-assisted peer review and scholarly integrity verification.

OpenEnv v0.4.0 4 Tasks Running Meta ร— PyTorch Hackathon

Available Tasks

formatting_compliance EASY

Fix IEEE manuscript formatting violations โ€” abstract length, section order, citation style, author block.

Max steps: 3Frontier baseline: 0.80โ€“0.95

internal_consistency MEDIUM

Find internal contradictions โ€” number mismatches, nonexistent references, inconsistent contribution counts.

Max steps: 4Frontier baseline: 0.40โ€“0.65

claim_evidence_audit HARD

Find where text claims don't match table values. RL training value: frontier LLMs score 0.20โ€“0.45 with no training.

Max steps: 6Frontier baseline: 0.20โ€“0.45

citation_verification MEDIUM

Identify ghost citations (fabricated) and misattributed references. SQLite cache stores verified citations across episodes.

Max steps: 8Frontier baseline: 0.35โ€“0.60

API Usage

POST/reset   {"task_id": "formatting_compliance"}
POST/step     {"task": "claim_evidence_audit", "action_type": "query_section", "section_name": "results"}
POST/step     {"task": "claim_evidence_audit", "action_type": "submit_findings", "findings": [...]}
GET /state    Returns current episode state and curriculum summary

Nensi Pansuriya ยท Krushna Parmar ยท Ishita Bhojani
Meta ร— PyTorch OpenEnv Hackathon ยท Round 1 ยท April 2026