# Codex User-Scope Loopy-Harness Report

작성일: 2026-05-03  
위치: `/Users/jung-wankim/.codex/harness`  
대상: Codex user-scope harness / memory-bank / loopy-era / claude-code-site 자료 통합 분석

## 0. One-line Verdict

당신이 요구하는 것은 단순한 프롬프트, 스킬, 규칙 묶음이 아니다. 요구 수준은 **Practical L6+ / L7-oriented Personal AI Work OS Harness**다.

핵심은 다음 폐루프다.

```text
사용자 철학/요구 수준
  → runtime 규칙/스킬/훅/검증기
  → 작업 실행
  → 증거 기반 검증
  → 실패/마찰 신호 수집
  → self-improve 패치
  → 재검증/ack/rollback
  → 다음 세션에 반영
```

## 1. 산출물 위치

### User-scope control plane

```text
/Users/jung-wankim/.codex/harness
```

생성된 구조:

```text
~/.codex/harness/
  README.md
  requirements.md
  structure.md
  contracts/
    loopy-era-runtime-contract.md
  scope/
    scope-classifier-matrix.md
  manifests/
    user-scope-target.json
  inventories/
    current-user-scope.json
  plans/
    adoption-plan.md
  reports/
    completion-audit.md
    completion-verification.json
```

### Spec artifact mirror

```text
/Users/jung-wankim/.omx/specs/codex-user-scope-loopy-harness
```

## 2. 근거 자료

이 보고서는 아래 자료를 기반으로 한다.

| 구분 | 경로 |
|---|---|
| Autoresearch 종합 분석 | `/Users/jung-wankim/.omx/specs/autoresearch-harness-loopy-era-analysis/report.md` |
| Autoresearch 승인 결과 | `/Users/jung-wankim/.omx/specs/autoresearch-harness-loopy-era-analysis/result.json` |
| Start-harness 분석 | `/Users/jung-wankim/Project/Claude/claude-code-site/start-harness-analysis.md` |
| Codex harness 문서 | `/Users/jung-wankim/Project/Claude/claude-code-site/codex-harness-system.html` |
| Claude Code harness 문서 | `/Users/jung-wankim/Project/Claude/claude-code-site/claude-code-harness-system.html` |
| Loopy-era architecture | `/Users/jung-wankim/Project/Claude/claude-code-site/loopy-era-architecture.html` |
| Memory Bank 분석 | `/Users/jung-wankim/Project/Claude/claude-code-site/memory-bank-analysis.html` |
| Enterprise harness complexity | `/Users/jung-wankim/Project/Claude/claude-code-site/docs/enterprise-harness-complexity-analysis.md` |
| Runtime alignment state | `/Users/jung-wankim/Project/Claude/claude-code-site/docs/engineering-discipline/harness/loopy-era-runtime-alignment/state.md` |
| Memory Bank README | `/Users/jung-wankim/Project/Claude/memory-bank/README-KR.md` |

## 3. 요구 수준: L6+ / L7 지향

| 레벨 | 의미 | 현재 요구와의 관계 |
|---|---|---|
| L0 | 좋은 답변/프롬프트 | 부족. 세션 종료 시 사라진다. |
| L1 | 스킬/명령 모음 | 부족. 실행은 쉬우나 학습 루프가 약하다. |
| L2 | 훅/스크립트 일부 자동화 | 부분 충족. 실패가 학습되지 않으면 부족하다. |
| L3 | Evidence gate가 있는 작업 루프 | 최소 기준. 완료 주장을 검증해야 한다. |
| L4 | Memory Bank 기반 반복 실수 방지 | 필수. 과거 대화와 결정이 runtime으로 돌아와야 한다. |
| L5 | HARD gate + soft-to-hard 승격 | 필수. 중요한 원칙은 권고가 아니라 차단/검증이어야 한다. |
| L6 | Self-improve closed loop | 목표 최소선. signal→patch→verify→ack/rollback이 닫혀야 한다. |
| L7 | User-scope Work OS | 지향점. user/project/plugin/MCP/trend까지 운영체계화한다. |

## 4. 핵심 요구사항 13개

| ID | 요구사항 | 적용 표면 |
|---|---|---|
| `REQ-AUTO-001` | 기본값은 자율 실행 | `AGENTS.md`, skills, agents/prompts |
| `REQ-EVID-001` | 완료는 증거로만 선언 | hooks, scripts, reports, harness/reports |
| `REQ-LOOP-001` | loopy-era 폐루프 완성 | scripts, state, reports, self-improve, trend-harvest |
| `REQ-HARD-001` | 중요한 규칙은 HARD gate로 승격 | rules/hard, hooks, scripts |
| `REQ-MEM-001` | Memory Bank를 runtime substrate로 사용 | hooks/memory-bank, MCP, memories |
| `REQ-STATE-001` | fresh context + persistent state 병행 | state, reports, MCP |
| `REQ-SCOPE-001` | user/project/plugin/MCP scope 분류 | harness/scope, scripts/project-scope |
| `REQ-ROLL-001` | keep/discard/rollback 규율 | trend-harvest, reports, manifests |
| `REQ-DRIFT-001` | runtime/source drift 관리 | config.toml, hooks.json, manifest inventory |
| `REQ-TREND-001` | 외부 trend를 개선 루프 입력으로 사용 | trend-harvest, loopy-era-trend-harvester |
| `REQ-HUMAN-001` | 사용자는 실행자가 아니라 방향 설정자 | AGENTS.md, workflow skills |
| `REQ-PRIV-001` | 개인 철학과 공개/기업용 구조 분리 | harness/contracts, docs, plugins |
| `REQ-VERIFY-001` | 완료 전 prompt-to-artifact audit 필수 | harness/reports/completion-audit.md |

## 5. 현재 Codex user-scope inventory

`~/.codex/harness/inventories/current-user-scope.json` 기준이다.

| 표면 | 수량 |
|---|---:|
| skills | 2522 files |
| agents | 29 files |
| prompts | 41 files |
| rules | 157 files |
| hooks | 134 files |
| scripts | 122 files |
| plugins | 211 files |
| memories | 3 files |
| reports | 14 files |
| state | 23 files |
| trend_harvest | 73 files |
| MCP servers | 5 configured |
| enabled plugins | 4 configured |

MCP 서버:

```text
omx_code_intel
omx_memory
omx_state
omx_trace
omx_wiki
```

Enabled plugins:

```text
browser-use@openai-bundled
documents@openai-primary-runtime
presentations@openai-primary-runtime
spreadsheets@openai-primary-runtime
```

## 6. Codex user-scope 구조 해석

기존 `~/.codex`는 이미 실행 표면을 충분히 갖고 있다. 따라서 완성 전략은 대규모 이동이 아니라 `harness/`를 control plane으로 추가해서 기존 표면을 정렬하는 것이다.

```text
~/.codex/
  AGENTS.md              # 전역 행동 계약
  config.toml            # model/MCP/plugin binding
  hooks.json             # hook activation map
  skills/                # workflow capsules
  agents/                # native agent TOML
  prompts/               # role prompt surfaces
  rules/                 # soft/checkable/HARD rule source
  hooks/                 # event-time guards
  scripts/               # validators/supervisors/self-improve
  state/                 # runtime state
  reports/               # evidence reports
  trend-harvest/         # external signal loop
  memories/              # durable context
  plugins/               # capability bundles/cache
  harness/               # loopy-era control plane
```

## 7. 기존 구조별 적용 방향

### Skills

Skill은 단순 사용법이 아니라 검증 가능한 task capsule이어야 한다. 우선 정렬 대상은 `autoresearch`, `self-improve`, `loopy-era-trend-harvester`, `start-harness`, `loopy-era-eval`, `ralph`, `ultraqa`, `team`이다.

### Rules

Rule은 Soft, Checkable, HARD 세 등급으로 관리한다. 반복 실패를 막고 자동 검증 가능한 규칙만 HARD로 승격한다.

### Agents / Prompts

Agent는 역할 경계를 지키는 실행 표면이다. `explore`, `researcher`, `dependency-expert`, `executor`, `verifier`, `security-reviewer`의 boundary를 명확히 한다.

### Plugins

Plugin은 capability surface다. cache 내부를 직접 수정하지 않고 config, skill trigger, verification contract에서 관리한다.

### MCP

MCP는 runtime substrate다. state, memory, code-intel, trace, wiki가 각각 lifecycle, 기억, 코드 근거, 실행 흐름, durable knowledge를 담당한다.

## 8. Adoption Plan

| Phase | 이름 | 상태 |
|---:|---|---|
| 1 | baseline-freeze-inventory | landed |
| 2 | requirements-control-plane-landing | landed |
| 3 | skill-application | planned |
| 4 | rule-hard-gate-application | planned |
| 5 | agent-prompt-application | planned |
| 6 | plugin-application | planned |
| 7 | mcp-application | planned |
| 8 | runtime-verification-expansion | planned |
| 9 | operating-loop | planned |

## 9. Verification Result

검증 파일:

```text
/Users/jung-wankim/.codex/harness/reports/completion-verification.json
```

결과:

```text
status: pass
checks: 53
failed: 0
prompt mappings: 8
```

검증 내용:

- 필수 파일 존재 및 non-empty 확인
- JSON manifest/inventory/verification parse 확인
- 모든 `REQ-*` ID가 manifest에 반영됐는지 확인
- adoption plan이 skill/rule/agent/plugin/MCP를 모두 포함하는지 확인
- structure 문서가 기존 user-scope 표면을 모두 포함하는지 확인
- runtime contract가 signal/scope/verification/ack/rollback을 포함하는지 확인
- prior autoresearch architect review가 `approved`인지 확인
- 사용자 프롬프트 요구사항이 산출물에 매핑되는지 확인

## 10. 남은 작업

현재 완료된 것은 요구사항, 구조, 계약, 적용 계획, 검증 감사다. 실제 runtime behavior 강화는 다음 단계다.

1. `completion-audit` validator를 독립 script로 만들기
2. `scope-classifier` CLI 만들기
3. critical skills audit script 만들기
4. MCP/plugin drift checker 만들기
5. loopy-era eval을 grep 기반에서 behavior scenario 기반으로 확장하기
6. soft rule → checkable rule → HARD gate 승격 자동화하기

## 11. Final Interpretation

당신이 원하는 시스템은 “내가 매번 지시하고 감시하는 시스템”이 아니다.

원하는 것은 다음이다.

> 내 철학과 반복 실수를 기억하고, 실패를 신호로 삼아 스스로 규칙/스킬/검증기를 갱신하며, 완료를 증거로만 선언하는 Codex user-scope 운영체계.

현재 그 운영체계의 기준점은 다음에 고정되어 있다.

```text
/Users/jung-wankim/.codex/harness
```

검증 결과:

```text
53 checks passed
0 failed
```