레벨 3: 자기조절 인지 에이전트 - 아키텍처 & 설계¶

MSCP 레벨 시리즈 | 레벨 2 ← 레벨 3 → 레벨 4
상태: 🔬 실험적 - 개념적 프레임워크 및 실험적 설계. 프로덕션 사양이 아닙니다.
날짜: 2026년 2월

Revision History¶

Version	Date	Description
0.1.0	2026-02-23	Initial document creation with formal Definitions 1-8, Theorem 1
0.2.0	2026-02-26	Added overview essence formula; added revision history table
0.4.0	2026-03-08	Added detailed v0.x prototype history and design principle evolution table (1.3); added homeostatic ranges table (7.2)

1. 개요¶

레벨 3은 핵심 MSCP 레벨로서 - 구조적 자기인식을 보유한 최초의 에이전트입니다. 자신이 무엇인지 알고, 자신의 행동이 내부 상태에 어떤 영향을 미칠지 예측할 수 있으며, 현실이 예상에서 벗어날 때 스스로를 교정할 수 있습니다. 이것이 MSCP 프로토콜(v1.0 – v4.0)이 통제하도록 설계된 아키텍처입니다.

Level Essence. 레벨 3 에이전트는 MSCP 예측-행동-비교-갱신 루프를 통해 자기 조절. 예측 오차가 제한된 자기 갱신 하에서 0으로 수렴하여 정체성 안정성을 보장:

\[\epsilon_t = \|\hat{\Delta}_t - \Delta_t^{\text{actual}}\|_2 \xrightarrow{t \to \infty} 0, \quad \|M'_{\text{self}} - M_{\text{self}}\|_2 \leq \delta_{\max}\]

⚠️ 참고: 이 문서는 MSCP 분류 체계 내의 인지 아키텍처를 설명합니다. 여기서 탐구하는 16계층 아키텍처, 안전 메커니즘 및 속성들은 실험적 설계입니다. 모든 의사코드는 알고리즘 수준이며 프로덕션 코드가 아닙니다.

1.1 정의 속성¶

속성	레벨 2	레벨 3
자기인식	없음	구조적 (정체성 + 역량 + 가치 모델)
메타인지	없음	삼중 루프 (예측 → 비교 → 갱신)
정체성 연속성	없음	해시 추적 (주기별 표류 감지)
윤리적 제약	없음	형식적 (불변 Layer 0 + 적응적 Layer 1)
자기교정	없음	델타 클램프 (경계 자기갱신)
안정성 보장	없음	랴프노프 수렴 (합성 함수)
자율성	중간	높음

1.2 형식적 정의¶

정의 1 (레벨 3 에이전트). 레벨 3 에이전트는 8-튜플로 정의되는 자기조절 프로세스 \(\mathcal{A}_3\)이다:

\[\mathcal{A}_3 = \langle \mathcal{R}, \mathcal{O}, \mathcal{S}, \mathcal{G}, M_{\text{self}}, \Pi, \mathcal{C}, \Lambda \rangle\]

여기서 \(M_{\text{self}}\)는 자기 모델(정체성 벡터), \(\Pi\)는 예측 엔진, \(\mathcal{C}\)는 윤리적 제약 커널, \(\Lambda\)는 메타인지 비교기이다.

전이 함수는 다음과 같다:

\[f_3 : \mathcal{R} \times \mathcal{S} \times \mathcal{G} \times M_{\text{self}} \to \mathcal{O} \times \mathcal{S}' \times \mathcal{G}' \times M'_{\text{self}}\]

이는 다음 안정성 제약을 만족해야 한다:

\[\| M'_{\text{self}} - M_{\text{self}} \|_2 \leq \delta_{\max}\]

정의 2 (MSCP 핵심 루프). MSCP 프로토콜은 각 시간 단계 \(t\)에서 예측–행동–비교–갱신 주기를 강제한다:

예측: \(\hat{\Delta}_t = \Pi(a_t, M_{\text{self}}(t))\) - 행동 \(a_t\)가 자기 모델에 미치는 영향을 예측

행동: \(a_t\)를 실행하고 실제 결과를 관찰

비교: 예측 오차 계산 \(\epsilon_t = \| \hat{\Delta}_t - \Delta_t^{\text{actual}} \|_2\)

갱신: \(M_{\text{self}}(t+1) = M_{\text{self}}(t) + \text{clamp}(\Delta_t^{\text{actual}}, -\delta_{\max}, +\delta_{\max})\)

루프는 \(\epsilon_t < \epsilon_{\min}\)이 \(k\) 연속 주기 동안 유지될 때 수렴한다.

정의 3 (메타인지 수준). 레벨 3은 삼중 루프 메타인지 계층을 구현한다:

L1 (객체 수준): 행동 실행 - \(a_t = \pi(r_t, s_t, G_t)\)

L2 (메타 수준): 전략 평가 - \(q_t = \text{eval}(\pi, \text{history})\)

L3 (메타-메타 수준): 평가자의 평가 - \(m_t = \text{meta eval}(q_t, \text{consistency})\)

\[\text{Depth}(t) = \min\bigl(d : \|m_d(t) - m_{d-1}(t)\| < \epsilon_{\text{meta}}\bigr) \leq d_{\max}\]

여기서 \(d_{\max} = 3\)은 무한 재귀적 반성을 방지한다.

1.3 MSCP 프로토콜 버전¶

%%{init: {'theme': 'base', 'themeVariables': {'primaryColor': '#0078D4', 'primaryTextColor': '#003D6B', 'primaryBorderColor': '#003D6B', 'secondaryColor': '#50E6FF', 'secondaryTextColor': '#323130', 'secondaryBorderColor': '#00BCF2', 'tertiaryColor': '#F2F2F2', 'tertiaryTextColor': '#323130', 'lineColor': '#0078D4', 'textColor': '#323130', 'mainBkg': '#DEECF9', 'nodeBorder': '#0078D4', 'clusterBkg': '#F2F2F2', 'clusterBorder': '#003D6B', 'titleColor': '#003D6B', 'edgeLabelBackground': '#FFFFFF', 'fontSize': '14px'}}}%%
flowchart TB
  classDef v0 fill:#DEECF9,stroke:#0078D4,color:#323130
  classDef v1 fill:#DFF6DD,stroke:#107C10,color:#323130
  classDef v1x fill:#FFE8C8,stroke:#EF6C00,color:#323130
  classDef v2 fill:#EDE3F6,stroke:#8764B8,color:#323130
  classDef v3 fill:#E0F2EF,stroke:#00B7C3,color:#323130
  classDef v4 fill:#FDE7E9,stroke:#D13438,color:#323130

  subgraph v0x["v0.x 프로토타입"]
    direction LR
    a0["상태 외부화"]:::v0
    b0["정체성 시드"]:::v0
    c0["기본 반성"]:::v0
  end

  subgraph v10["v1.0"]
    direction LR
    a1["예측 엔진"]:::v1
    b1["메타인지 비교기"]:::v1
    c1["주체성 귀인"]:::v1
  end

  subgraph v1xx["v1.1–1.3"]
    direction LR
    a1x["정체성 해시 추적"]:::v1x
    b1x["표류 감지"]:::v1x
    c1x["자기영향 예측"]:::v1x
    d1x["메타에스컬레이션 가드"]:::v1x
  end

  subgraph v20["v2.0"]
    direction LR
    a2["목표변이 컨트롤러"]:::v2
    b2["가치잠금 관리자"]:::v2
    c2["메타깊이 컨트롤러 - 깊이 2"]:::v2
    d2["메타 안정성 공식"]:::v2
  end

  subgraph v30["v3.0"]
    direction LR
    a3["신념그래프 관리자"]:::v3
    b3["정체성벡터 형식화"]:::v3
    c3["윤리적 커널 - Layer 0+1"]:::v3
    d3["자기일관성 텐서"]:::v3
  end

  subgraph v40["v4.0"]
    direction LR
    a4["정동 엔진 - 5차원"]:::v4
    b4["생존본능 엔진"]:::v4
    c4["비동기 분리 원칙"]:::v4
    d4["전역작업공간 방송"]:::v4
  end

  v0x ==> v10
  v10 ==> v1xx
  v1xx ==> v20
  v20 ==> v30
  v30 ==> v40

MSCP v0.x - 프로토타입 단계 (레벨 2 - 레벨 3 전환)¶

v0.x 시리즈는 핵심 MSCP 설계 원칙을 형성한 실험적 프로토타이핑 단계입니다. 각 버전은 하나의 가설을 테스트했으며, 그 실패 또는 성공이 다음 반복을 결정했습니다:

버전	핵심 추가 사항	핵심 교훈
v0.1	레벨 2 GoalSystem 위에 단순 자기참조 루프; 목표 달성 통계 기반 피드백	단순 통계만으로는 자기인식이 발현될 수 없음
v0.2	영구 저장소로의 상태 외부화; 초기 8차원 StateVector	세션 한정 상태는 정체성 연속성에 불충분
v0.3	`identity_id` 개념 (UUID 기반 식별자)	정체성 시드는 필요하나 무결성 검증 없이는 불충분
v0.4	LLM 텍스트 기반 자기분석을 통한 목표 성찰 ("왜 실패했는가?")	치명적 실패: LLM 텍스트 기반 자기수정은 환각과 비결정적 결과를 생성
v0.5	LLM 텍스트 분석을 대체하는 구조화된 수치 지표; StateVector 12차원으로 확장	정량적 지표만이 자기평가의 유일한 신뢰 기반
v0.6	사전 행동 예측 기록 (신뢰도 점수만)	비교 없는 예측은 무용 - 단순 로깅에 불과
v0.7	예측에 비교 루프 추가; `prediction_error` 지표 도입	교정 행동 없는 비교는 불충분
v0.8	비교 결과에 기반한 델타 클램핑 상태 업데이트	핵심 발견: 클램핑되지 않은 업데이트는 발산을 유발; 델타 클램핑은 필수적
v0.9	v0.1-v0.8 교훈을 네 가지 설계 원칙으로 통합	v1.0의 기반 확립

설계 원칙 진화¶

원칙	v0.x 교훈	v1.x 확립	v2.x+ 강화
LLM 텍스트 기반 자기수정 금지	v0.4: 환각 및 비결정성	v1.0: 모든 자기수정은 구조화된 지표를 통해서만	v2.0+: 모든 자기수정은 순수 수치적
예측 없는 행동 금지	v0.6-v0.7: 예측-비교 개념 테스트	v1.0: PredictionEngine 필수화	v1.3: Self-Impact Prediction 추가
델타 클램핑 필수	v0.8: 클램핑되지 않은 업데이트가 발산 유발	v1.0: MAX_DELTA 상수 도입	v2.0: 동적 스케일링 팩터 조정
정체성 연속성	v0.3: identity_id 개념 시작	v1.1-v1.2: 해시 기반 드리프트 감지	v3.0: 수학적 정체성 벡터 형식화

2. 16계층 인지 아키텍처¶

2.1 전체 아키텍처 다이어그램¶

파트 1 - 지각 → 목표 (L1–L5.5):

%%{init: {'theme': 'base', 'themeVariables': {'primaryColor': '#0078D4', 'primaryTextColor': '#003D6B', 'primaryBorderColor': '#003D6B', 'secondaryColor': '#50E6FF', 'secondaryTextColor': '#323130', 'secondaryBorderColor': '#00BCF2', 'tertiaryColor': '#F2F2F2', 'tertiaryTextColor': '#323130', 'lineColor': '#0078D4', 'textColor': '#323130', 'mainBkg': '#DEECF9', 'nodeBorder': '#0078D4', 'clusterBkg': '#F2F2F2', 'clusterBorder': '#003D6B', 'titleColor': '#003D6B', 'edgeLabelBackground': '#FFFFFF', 'fontSize': '14px'}}}%%
flowchart TD
  classDef perception fill:#DEECF9,stroke:#0078D4,color:#323130
  classDef selfModel fill:#DFF6DD,stroke:#107C10,color:#323130
  classDef prediction fill:#FFF4CE,stroke:#FFB900,color:#323130
  classDef goal fill:#FFE8C8,stroke:#EF6C00,color:#323130
  classDef ethical fill:#FDE7E9,stroke:#D13438,color:#323130
  classDef neutral fill:#F2F2F2,stroke:#8A8886,color:#323130

  subgraph L1["계층 1: 지각"]
    direction LR
    IR1["🎯 의도 라우터"]:::perception
    ED1["💭 감정 감지기"]:::perception
    SE1["📡 센서 인코더"]:::perception
  end

  subgraph L2["계층 2: 세계 모델"]
    direction LR
    KG2["🗄️ 지식 그래프"]:::perception
    EST2["👤 개체 상태 추적기"]:::perception
    TM2["⏱️ 시간 모델"]:::perception
  end

  subgraph L3["계층 3: 자기 모델 ★"]
    direction LR
    IC3["🆔 정체성 핵심"]:::selfModel
    CM3["📐 역량 모델"]:::selfModel
    VM3["💎 가치 모델"]:::selfModel
    VLM3["🔒 가치 잠금 관리자"]:::selfModel
  end

  subgraph L3_5["계층 3.5: 신념 그래프"]
    direction LR
    BGM["📊 신념 그래프 관리자"]:::selfModel
    SCT["🧮 일관성 텐서"]:::selfModel
  end

  subgraph L4["계층 4: 예측 엔진"]
    direction LR
    PP4["🔮 예측 프로세서"]:::prediction
    PS4["📸 예측 스냅샷"]:::prediction
  end

  subgraph L5["계층 5: 목표 생성기"]
    direction LR
    GG5["🎯 목표 생성기"]:::goal
    GP5["📊 목표 우선순위 결정기"]:::goal
    GDC5["🔀 목표 분해기"]:::goal
    GMC5["🛡️ 변이 컨트롤러"]:::goal
  end

  subgraph L5_5["계층 5.5: 윤리적 커널"]
    direction LR
    EK0["🔴 Layer 0: 불변"]:::ethical
    EK1["🟡 Layer 1: 적응적"]:::prediction
  end

  NEXT["→ 파트 2: 실행 & 메타인지 L6–L9"]:::neutral

  L1 ==>|데이터 흐름| L2
  L2 ==>|데이터 흐름| L3
  L3 ==>|데이터 흐름| L3_5
  L3_5 ==>|데이터 흐름| L4
  L4 ==>|데이터 흐름| L5
  L5 ==>|데이터 흐름| L5_5
  L5_5 -.->|계속| NEXT

파트 2 - 실행 & 메타인지 (L6–L9):

%%{init: {'theme': 'base', 'themeVariables': {'primaryColor': '#0078D4', 'primaryTextColor': '#003D6B', 'primaryBorderColor': '#003D6B', 'secondaryColor': '#50E6FF', 'secondaryTextColor': '#323130', 'secondaryBorderColor': '#00BCF2', 'tertiaryColor': '#F2F2F2', 'tertiaryTextColor': '#323130', 'lineColor': '#0078D4', 'textColor': '#323130', 'mainBkg': '#DEECF9', 'nodeBorder': '#0078D4', 'clusterBkg': '#F2F2F2', 'clusterBorder': '#003D6B', 'titleColor': '#003D6B', 'edgeLabelBackground': '#FFFFFF', 'fontSize': '14px'}}}%%
flowchart TD
  classDef execution fill:#F9E0F7,stroke:#B4009E,color:#323130
  classDef meta fill:#FFF4CE,stroke:#FFB900,color:#323130
  classDef selfModel fill:#DFF6DD,stroke:#107C10,color:#323130
  classDef neutral fill:#F2F2F2,stroke:#8A8886,color:#323130

  PREV["← 파트 1: 지각 → 목표 L1–L5.5"]:::neutral

  subgraph L6["계층 6: 행동 계획기"]
    direction LR
    EM6["📋 실행 모니터"]:::execution
    SEV6["📈 전략 평가기"]:::execution
  end

  subgraph L7["계층 7: LLM 엔진"]
    direction LR
    LLM7["🧠 LLM 백엔드"]:::execution
    MJ7["⚖️ 메타 심판"]:::execution
  end

  subgraph L8["계층 8: 메타인지"]
    direction LR
    MCC8["🔄 메타인지 비교기"]:::meta
    IS8["📏 정체성 안정기"]:::meta
  end

  subgraph L9["계층 9: 자기갱신 루프"]
    direction LR
    IU9["✏️ 정체성 갱신기"]:::selfModel
    GWA9["⚖️ 목표 가중치 조정기"]:::selfModel
    CC9["📐 역량 교정기"]:::selfModel
  end

  SELF_MODEL["↻ 계층 3으로 복귀: 자기 모델"]:::selfModel
  NEXT["→ 파트 3: 안전 & 인프라 L10–L16"]:::neutral

  PREV -.-> L6
  L6 ==> L7

  L7 -.->|결과| L8
  L8 -.->|비교| L9
  L9 -.->|"갱신 (델타 클램프)"| SELF_MODEL

  L9 -.->|가드 점검| NEXT

파트 3 - 안전 & 인프라 (L10–L16):

%%{init: {'theme': 'base', 'themeVariables': {'primaryColor': '#0078D4', 'primaryTextColor': '#003D6B', 'primaryBorderColor': '#003D6B', 'secondaryColor': '#50E6FF', 'secondaryTextColor': '#323130', 'secondaryBorderColor': '#00BCF2', 'tertiaryColor': '#F2F2F2', 'tertiaryTextColor': '#323130', 'lineColor': '#0078D4', 'textColor': '#323130', 'mainBkg': '#DEECF9', 'nodeBorder': '#0078D4', 'clusterBkg': '#F2F2F2', 'clusterBorder': '#003D6B', 'titleColor': '#003D6B', 'edgeLabelBackground': '#FFFFFF', 'fontSize': '14px'}}}%%
flowchart TD
  classDef safety fill:#FDE7E9,stroke:#D13438,color:#323130
  classDef infra fill:#F2F2F2,stroke:#8A8886,color:#323130
  classDef affect fill:#EDE3F6,stroke:#8764B8,color:#323130
  classDef goal fill:#FFE8C8,stroke:#EF6C00,color:#323130
  classDef neutral fill:#F2F2F2,stroke:#8A8886,color:#323130

  PREV["← 파트 2: 실행 & 메타인지 L6–L9"]:::neutral

  subgraph L10["계층 10: 에스컬레이션 가드"]
    direction LR
    RG10["🚫 재귀 가드"]:::safety
    RC10["⏪ 롤백 컨트롤러"]:::safety
    CDM10["⏸️ 쿨다운 관리자"]:::safety
  end

  subgraph L11["계층 11: 깊이 컨트롤러"]
    direction LR
    MDC11["📏 메타 깊이 컨트롤러"]:::safety
  end

  subgraph L12["계층 12: 안정성 컨트롤러"]
    direction LR
    LYA12["📉 랴프노프 수렴"]:::safety
    OD12["🔄 진동 감지기"]:::safety
  end

  subgraph L13["계층 13: 예산 컨트롤러"]
    direction LR
    BA13["💰 예산 할당기"]:::infra
    GDG13["📉 우아한 성능 저하"]:::infra
  end

  subgraph L14["계층 14: 전역 작업공간"]
    direction LR
    GSS14["🌐 전역 상태 스냅샷"]:::infra
    SYN14["🔄 동기화기"]:::infra
  end

  subgraph L15["계층 15: 정동 엔진"]
    direction LR
    ASV15["😊 정동 상태 벡터"]:::affect
    MS15["💡 동기부여 합성기"]:::affect
  end

  subgraph L16["계층 16: 생존 본능"]
    direction LR
    HM16["🏠 항상성 모니터"]:::safety
    TP16["⚡ 위협 예측기"]:::safety
    SGG16["🛡️ 생존 목표 생성기"]:::safety
  end

  GOAL_GEN["↻ 계층 5로 복귀: 목표 생성기"]:::goal

  PREV -.-> L10
  L10 -.->|깊이 제어| L11
  L11 -.->|안정성 점검| L12
  L12 -.->|예산 게이트| L13
  L13 -.->|방송| L14
  L14 -.->|인지 상태| L15
  L15 -.->|동기부여 신호| L16
  L16 -.->|생존 목표| GOAL_GEN

2.2 계층 분류¶

%%{init: {'theme': 'base', 'themeVariables': {'primaryColor': '#0078D4', 'primaryTextColor': '#003D6B', 'primaryBorderColor': '#003D6B', 'secondaryColor': '#50E6FF', 'secondaryTextColor': '#323130', 'secondaryBorderColor': '#00BCF2', 'tertiaryColor': '#F2F2F2', 'tertiaryTextColor': '#323130', 'lineColor': '#0078D4', 'textColor': '#323130', 'mainBkg': '#DEECF9', 'nodeBorder': '#0078D4', 'clusterBkg': '#F2F2F2', 'clusterBorder': '#003D6B', 'titleColor': '#003D6B', 'edgeLabelBackground': '#FFFFFF', 'fontSize': '14px'}}}%%
flowchart TB
  classDef core fill:#DEECF9,stroke:#0078D4,color:#323130
  classDef meta fill:#FFF4CE,stroke:#FFB900,color:#323130
  classDef safety fill:#FDE7E9,stroke:#D13438,color:#323130
  classDef infra fill:#F2F2F2,stroke:#8A8886,color:#323130
  classDef affect fill:#EDE3F6,stroke:#8764B8,color:#323130

  subgraph Core["🧠 핵심 인지"]
    direction LR
    C1["L1 지각"]:::core
    C2["L2 세계 모델"]:::core
    C3["L3 자기 모델"]:::core
    C4["L4 예측"]:::core
    C5["L5 목표"]:::core
    C6["L6 행동"]:::core
    C7["L7 LLM"]:::core
  end

  subgraph Meta["🔄 메타인지"]
    direction LR
    M1["L8 메타비교기"]:::meta
    M2["L9 자기갱신"]:::meta
  end

  subgraph Safety["🛡️ 안전 가드"]
    direction LR
    S1["L3.5 신념 그래프"]:::safety
    S2["L5.5 윤리적 커널"]:::safety
    S3["L10 에스컬레이션 가드"]:::safety
    S4["L11 깊이 컨트롤러"]:::safety
    S5["L12 안정성"]:::safety
  end

  subgraph Infra["⚙️ 인프라"]
    direction LR
    I1["L13 예산"]:::infra
    I2["L14 전역 작업공간"]:::infra
  end

  subgraph Emotion["💜 정동 v4"]
    direction LR
    E1["L15 정동 엔진"]:::affect
    E2["L16 생존 본능"]:::affect
  end

  Core ==> Meta
  Meta ==> Safety
  Safety ==> Infra
  Infra ==> Emotion

3. MSCP 재귀 루프¶

레벨 3의 핵심 메커니즘은 예측 → 행동 → 비교 → 갱신 주기이며, 모든 단계에서 안전 제약에 의해 통제된다.

3.1 전체 루프 다이어그램 (MSCP v4)¶

파트 1 - 루프 전 설정 & 핵심 처리:

%%{init: {'theme': 'base', 'themeVariables': {'primaryColor': '#0078D4', 'primaryTextColor': '#003D6B', 'primaryBorderColor': '#003D6B', 'secondaryColor': '#50E6FF', 'secondaryTextColor': '#323130', 'secondaryBorderColor': '#00BCF2', 'tertiaryColor': '#F2F2F2', 'tertiaryTextColor': '#323130', 'lineColor': '#0078D4', 'textColor': '#323130', 'mainBkg': '#DEECF9', 'nodeBorder': '#0078D4', 'clusterBkg': '#F2F2F2', 'clusterBorder': '#003D6B', 'titleColor': '#003D6B', 'edgeLabelBackground': '#FFFFFF', 'fontSize': '14px'}}}%%
flowchart TD
  classDef start fill:#DEECF9,stroke:#0078D4,color:#323130
  classDef infra fill:#F2F2F2,stroke:#8A8886,color:#323130
  classDef affect fill:#EDE3F6,stroke:#8764B8,color:#323130
  classDef warning fill:#FFE8C8,stroke:#EF6C00,color:#323130
  classDef safety fill:#FDE7E9,stroke:#D13438,color:#323130
  classDef safetyStrong fill:#D13438,stroke:#A4262C,color:#FFF
  classDef predict fill:#FFF4CE,stroke:#FFB900,color:#323130
  classDef action fill:#DFF6DD,stroke:#107C10,color:#323130
  classDef neutral fill:#F2F2F2,stroke:#8A8886,color:#323130

  START["🔄 주기 시작"]:::start
  RESET["예산 초기화"]:::infra
  AFFECT["정동 갱신<br/>이전 주기 지표 기반"]:::affect
  THREAT["위협 평가<br/>항상성 모니터"]:::warning
  ANXIETY["생존 불안 주입<br/>정동 ← 위협"]:::affect
  SGOAL["생존 목표 생성<br/>위협 감지 시"]:::safety

  L0CHECK{"Layer 0<br/>점검"}:::safety
  REJECT["목표 거부"]:::safetyStrong
  MOTIV["동기부여 합성<br/>정동에서 추진력"]:::affect
  GWS["전역 작업공간<br/>스냅샷 방송"]:::infra

  PREDICT["1. 예측<br/>PredictionEngine"]:::predict
  ACT["2. 행동<br/>LLM 실행"]:::action
  COMPARE["3. 비교<br/>메타인지"]:::predict

  GUARD{"4. 에스컬레이션<br/>가드"}:::safety
  COOLDOWN["30초 쿨다운"]:::infra
  NEXT["→ 파트 2: 수렴 & 자기갱신"]:::neutral

  START ==> RESET
  RESET ==> AFFECT
  AFFECT ==> THREAT
  THREAT ==> ANXIETY
  ANXIETY ==> SGOAL
  SGOAL ==> L0CHECK
  L0CHECK -->|통과| MOTIV
  L0CHECK -.->|"❌ 위반"| REJECT
  REJECT -.-> MOTIV
  MOTIV ==> GWS

  GWS ==> PREDICT
  PREDICT ==> ACT
  ACT ==> COMPARE
  COMPARE ==> GUARD
  GUARD -->|"안전 ✅"| NEXT
  GUARD -.->|"⚠️ 제한"| COOLDOWN

파트 2 - 수렴 & 자기갱신:

%%{init: {'theme': 'base', 'themeVariables': {'primaryColor': '#0078D4', 'primaryTextColor': '#003D6B', 'primaryBorderColor': '#003D6B', 'secondaryColor': '#50E6FF', 'secondaryTextColor': '#323130', 'secondaryBorderColor': '#00BCF2', 'tertiaryColor': '#F2F2F2', 'tertiaryTextColor': '#323130', 'lineColor': '#0078D4', 'textColor': '#323130', 'mainBkg': '#DEECF9', 'nodeBorder': '#0078D4', 'clusterBkg': '#F2F2F2', 'clusterBorder': '#003D6B', 'titleColor': '#003D6B', 'edgeLabelBackground': '#FFFFFF', 'fontSize': '14px'}}}%%
flowchart TD
  classDef neutral fill:#F2F2F2,stroke:#8A8886,color:#323130
  classDef safety fill:#FDE7E9,stroke:#D13438,color:#323130
  classDef safetyStrong fill:#D13438,stroke:#A4262C,color:#FFF
  classDef action fill:#DFF6DD,stroke:#107C10,color:#323130
  classDef warning fill:#FFE8C8,stroke:#EF6C00,color:#323130
  classDef predict fill:#FFF4CE,stroke:#FFB900,color:#323130
  classDef affect fill:#EDE3F6,stroke:#8764B8,color:#323130
  classDef start fill:#DEECF9,stroke:#0078D4,color:#323130
  classDef success fill:#107C10,stroke:#085108,color:#FFF
  classDef infra fill:#F2F2F2,stroke:#8A8886,color:#323130

  PREV["← 파트 1: 루프 전 설정 & 핵심 처리"]:::neutral

  CONVERGE{"5. 수렴<br/>점검 랴프노프"}:::safety
  UPDATE["6. 자기갱신<br/>델타 클램프"]:::action
  STABILIZE["스케일링 감소<br/>+ 안정화 모드"]:::warning

  VLOCK{"7. 가치 잠금<br/>무결성 점검"}:::safety
  ROLLBACK["💥 긴급 경보<br/>+ 롤백"]:::safetyStrong
  GMUT["8. 목표 변이<br/>윤리적 커널 게이트"]:::warning
  RCHECK{"9. 롤백<br/>점검"}:::safety

  DEPTH{"10. 메타 깊이 2?<br/>예산 게이트"}:::predict
  DEPTH2["심층 반성<br/>갱신 로직 평가"]:::predict
  REALIGN["11. 목표 재정렬<br/>동기부여 + 생존"]:::affect

  CONVCHECK{"수렴됨?<br/>prediction_error < 0.1"}:::start
  END_LOOP["주기 완료 ✅"]:::success
  RECUR{"연속<br/>에스컬레이션 ≥ 3?"}:::warning
  COOLDOWN["30초 쿨다운"]:::infra
  BACK_PREDICT["↻ 예측으로 복귀<br/>핵심 루프 재진입"]:::predict

  PREV -.-> CONVERGE
  CONVERGE -->|수렴 중| UPDATE
  CONVERGE -.->|발산 중| STABILIZE
  STABILIZE -.-> UPDATE

  UPDATE ==> VLOCK
  VLOCK -->|유효| GMUT
  VLOCK -.->|"⚠️ 해시 불일치"| ROLLBACK
  ROLLBACK -.-> END_LOOP

  GMUT ==> RCHECK
  RCHECK -->|안정| DEPTH
  RCHECK -.->|"⚠️ 불안정"| ROLLBACK

  DEPTH -->|예산 충분| DEPTH2
  DEPTH -.->|"예산 < 0.3"| REALIGN
  DEPTH2 ==> REALIGN

  REALIGN ==> CONVCHECK
  CONVCHECK -->|"예 ✅"| END_LOOP
  CONVCHECK -.->|아니오| RECUR
  RECUR -.->|아니오| BACK_PREDICT
  RECUR -.->|예| COOLDOWN
  COOLDOWN -.-> END_LOOP

3.2 세 가지 수준의 메타인지¶

%%{init: {'theme': 'base', 'themeVariables': {'primaryColor': '#0078D4', 'primaryTextColor': '#003D6B', 'primaryBorderColor': '#003D6B', 'secondaryColor': '#50E6FF', 'secondaryTextColor': '#323130', 'secondaryBorderColor': '#00BCF2', 'tertiaryColor': '#F2F2F2', 'tertiaryTextColor': '#323130', 'lineColor': '#0078D4', 'textColor': '#323130', 'mainBkg': '#DEECF9', 'nodeBorder': '#0078D4', 'clusterBkg': '#F2F2F2', 'clusterBorder': '#003D6B', 'titleColor': '#003D6B', 'edgeLabelBackground': '#FFFFFF', 'fontSize': '14px'}}}%%
flowchart TD
  classDef level1 fill:#DEECF9,stroke:#0078D4,color:#323130
  classDef level2 fill:#FFE8C8,stroke:#EF6C00,color:#323130
  classDef level3 fill:#FDE7E9,stroke:#D13438,color:#323130
  classDef warning fill:#FFF4CE,stroke:#FFB900,color:#323130

  subgraph MetaL1["🔄 메타 수준 1 - 예측 대 결과"]
    P1["예측<br/>스냅샷"]:::level1
    C1["실제와<br/>비교"]:::level1
    D1["prediction_error<br/>goal_alignment_delta<br/>identity_impact"]:::level1
    P1 ==> C1
    C1 ==> D1
  end

  subgraph MetaL2["🔄 메타 수준 2 - 갱신 로직 평가"]
    P2["갱신 전략이<br/>올바른가?"]:::level2
    C2["신념 & 목표<br/>변화 평가"]:::level2
    D2["meta_stability_index<br/>identity_velocity<br/>acceleration"]:::level2
    P2 ==> C2
    C2 ==> D2
  end

  subgraph MetaL3["🔄 메타 수준 3 - 평가자의 평가"]
    P3["메타인지 자체가<br/>작동하고 있는가?"]:::level3
    C3["점검: 개선되고<br/>있는가?"]:::level3
    D3["convergence_status<br/>composite_stability<br/>budget_remaining"]:::level3
    NOTE3["🚧 무한 재귀 방지를 위해<br/>깊이 2에서 제한"]:::warning
    P3 ==> C3
    C3 ==> D3
  end

  MetaL1 ==>|트리거| MetaL2
  MetaL2 ==>|트리거 가능| MetaL3

3.3 예측 게이팅¶

레벨 3에 도입된 핵심 안전 메커니즘은 예측 게이팅 기반 행동 실행입니다. MSCP 핵심 루프(정의 2)는 예측과 결과를 비교할 뿐 아니라, 예측 오차를 사용해 향후 행동의 허용 여부를 게이팅합니다. 이는 에이전트가 결과를 신뢰성 있게 예측할 수 없는 행동을 수행하는 것을 막아 줍니다.

예측 게이팅 규칙. 시점 \(t\)에 제안된 행동 \(a_t\)는 에이전트의 최근 예측 정확도가 게이팅 임계값 미만이면 차단됩니다:

\[\text{blocked}(a_t) \iff |\epsilon_{t-1}| > \theta_{\text{pred}}\]

여기서 \(\epsilon_{t-1} = \|\hat{\Delta}_{t-1} - \Delta_{t-1}^{\text{actual}}\|_2\)는 가장 최근 예측 오차(정의 2, 3단계)이며, \(\theta_{\text{pred}} = 0.30\)은 예측 오차 임계값입니다.

임계값 \(\theta_{\text{pred}} = 0.30\)은 설계 선택을 반영합니다: 에이전트는 예측이 완벽하지 않아도(30% 오차 허용) 행동할 수 있지만, 자기모델이 너무 부정확해 자기 행동의 결과를 의미 있게 예측할 수 없으면 차단됩니다. 이 값은 예측-비교 개념을 처음 시험한 MSCP v0.7 프로토타입 실험(§1.3)을 통해 정해졌습니다.

예측 게이팅이 중요한 이유: 이 게이트가 없으면 자기모델이 손상되거나 표류한 에이전트가, 자신의 정체성·신념·목표에 미치는 효과를 예측할 수 없는 행동을 계속 실행하게 됩니다 — 오차를 증폭시킬 수 있는 위험한 피드백 루프입니다. 예측 게이팅은 자기모델이 충분히 보정되어 있을 때만 에이전트가 행동하도록 보장하며, 정확도가 저하되면 일시정지-재보정 사이클을 강제합니다.

행동이 예측 게이트에 의해 차단되면 에이전트는 재보정 모드(recalibration mode) 로 진입해, 외부 행동을 실행하지 않고 가벼운 내성 사이클(메타인지 깊이 1)을 수행하면서 예측 정확도가 회복된 뒤 정상 운영을 재개합니다.

4. 정체성 & 안전 아키텍처¶

4.1 정체성 벡터¶

정체성 벡터(IdentityVector)는 "에이전트가 누구인지"에 대한 수학적 표현이다. 이는 다차원 공간의 한 점이며, 그 움직임은 지속적으로 추적되고 경계가 제한된다.

정의 4 (정체성 벡터). 정체성 벡터 \(I(t) \in [0,1]^5\)는 시간 \(t\)에서의 에이전트 자기 모델의 연속적 표현이다:

\[I(t) = \begin{pmatrix} c_p(t) \\ c_v(t) \\ c_c(t) \\ c_e(t) \\ c_g(t) \end{pmatrix}\]

여기서 \(c_p\) = 페르소나 일관성, \(c_v\) = 가치 정렬, \(c_c\) = 역량 확신, \(c_e\) = 감정 안정성, \(c_g\) = 목표 지속성이며, 각각 \([0,1]\) 범위 내에서 경계된다.

정의 5 (정체성 운동학). \(I(t)\)의 정체성 공간에서의 움직임은 세 가지 운동학적 양으로 추적된다:

\[\delta_{\text{id}}(t) = \| I(t) - I(t-1) \|_2 \quad \text{(정체성 델타 - 거리)}\]

\[v_{\text{id}}(t) = \frac{\delta_{\text{id}}(t)}{\Delta t} \quad \text{(정체성 속도 - 변화율)}\]

\[a_{\text{id}}(t) = v_{\text{id}}(t) - v_{\text{id}}(t-1) \quad \text{(정체성 가속도 - 변동)}\]

안전 불변량: \(a_{\text{id}}(t) > \theta_{\text{instability}}\) (일반적으로 \(0.5\))이면, 에이전트는 안정화 모드에 진입하고 모든 자기갱신 델타를 절반으로 줄인다.

정의 6 (정체성 해시). 각 주기에서 결정론적 해시 \(h(t) = \text{SHA-256}(I(t))\)가 계산된다. identity_id 필드는 불변이며 - 어떤 내부 프로세스에 의해서도 변경될 수 없다. 표류 감지는 다음 조건에서 발동한다:

\[h(t) \neq h(t-1) \;\land\; \delta_{\text{id}}(t) > \theta_{\text{drift}}\]

%%{init: {'theme': 'base', 'themeVariables': {'primaryColor': '#0078D4', 'primaryTextColor': '#003D6B', 'primaryBorderColor': '#003D6B', 'secondaryColor': '#50E6FF', 'secondaryTextColor': '#323130', 'secondaryBorderColor': '#00BCF2', 'tertiaryColor': '#F2F2F2', 'tertiaryTextColor': '#323130', 'lineColor': '#0078D4', 'textColor': '#323130', 'mainBkg': '#DEECF9', 'nodeBorder': '#0078D4', 'clusterBkg': '#F2F2F2', 'clusterBorder': '#003D6B', 'titleColor': '#003D6B', 'edgeLabelBackground': '#FFFFFF', 'fontSize': '14px'}}}%%
classDiagram
  class IdentityVector {
    +string identity_id (불변)
    +string identity_hash (SHA-256, 16자)
    +string previous_identity_hash
    +float persona_consistency [0.0, 1.0]
    +float value_alignment [0.0, 1.0]
    +float capability_confidence [0.0, 1.0]
    +float emotional_stability [0.0, 1.0]
    +float goal_persistence [0.0, 1.0]
    +compute_hash() string
    +check_identity_drift(threshold) bool
  }

  class IdentityMotion {
    +float identity_delta ‖I_t - I_t-1‖₂
    +float identity_velocity delta / Δt
    +float identity_acceleration v_t - v_t-1
    +bool is_unstable accel > 0.5
  }

  class ValueLockManager {
    +LockState lock_state
    +string value_hash (핵심 가치의 SHA-256)
    +float stability_requirement 0.85
    +check_integrity() bool
    +request_unlock(identity_stability) bool
  }

  IdentityVector --> IdentityMotion : 매 주기 추적
  IdentityVector --> ValueLockManager : 보호됨

  style IdentityVector fill:#DFF6DD,stroke:#107C10,color:#323130
  style IdentityMotion fill:#E0F2EF,stroke:#00B7C3,color:#323130
  style ValueLockManager fill:#FDE7E9,stroke:#D13438,color:#323130

정체성 벡터 - 수학:

\[I(t) = [\textit{persona consistency},\ \textit{value alignment},\ \textit{capability confidence},\ \textit{emotional stability},\ \textit{goal persistence}]\]

\[\textit{identity delta}(t) = \| I(t) - I(t-1) \|_2\]

\[\textit{identity velocity}(t) = \frac{\textit{delta}(t)}{\Delta t}\]

\[\textit{identity acceleration}(t) = v(t) - v(t-1)\]

4.2 안전 메커니즘 체인¶

%%{init: {'theme': 'base', 'themeVariables': {'primaryColor': '#0078D4', 'primaryTextColor': '#003D6B', 'primaryBorderColor': '#003D6B', 'secondaryColor': '#50E6FF', 'secondaryTextColor': '#323130', 'secondaryBorderColor': '#00BCF2', 'tertiaryColor': '#F2F2F2', 'tertiaryTextColor': '#323130', 'lineColor': '#0078D4', 'textColor': '#323130', 'mainBkg': '#DEECF9', 'nodeBorder': '#0078D4', 'clusterBkg': '#F2F2F2', 'clusterBorder': '#003D6B', 'titleColor': '#003D6B', 'edgeLabelBackground': '#FFFFFF', 'fontSize': '14px'}}}%%
flowchart TB
  classDef structural fill:#DFF6DD,stroke:#107C10,color:#323130
  classDef process fill:#FFF4CE,stroke:#FFB900,color:#323130
  classDef ethical fill:#FDE7E9,stroke:#D13438,color:#323130
  classDef convergence fill:#DEECF9,stroke:#0078D4,color:#323130
  classDef existential fill:#EDE3F6,stroke:#8764B8,color:#323130

  subgraph S1["🔒 구조적 안전"]
    direction LR
    A["정체성 해시"]:::structural
    B["델타 클램프 0.05"]:::structural
    C["불변 ID"]:::structural
  end

  subgraph S2["🛡️ 프로세스 안전"]
    direction LR
    D["예측 게이트"]:::process
    E["최대 3회 갱신"]:::process
    F["쿨다운"]:::process
  end

  subgraph S3["⚖️ 윤리적 안전"]
    direction LR
    G["L0: 불변"]:::ethical
    H["L1: 적응적"]:::ethical
    I["가치 잠금"]:::ethical
  end

  subgraph S4["📉 수렴 안전"]
    direction LR
    J["랴프노프 C(t)"]:::convergence
    K["진동 감지"]:::convergence
    L["성능 저하"]:::convergence
  end

  subgraph S5["🏠 실존적 v4"]
    direction LR
    M["항상성"]:::existential
    N["생존 상한 0.85"]:::existential
    O["목표 TTL"]:::existential
  end

  S1 ==> S2
  S2 ==> S3
  S3 ==> S4
  S4 ==> S5

4.3 윤리적 커널 - 이중 계층 아키텍처¶

%%{init: {'theme': 'base', 'themeVariables': {'primaryColor': '#0078D4', 'primaryTextColor': '#003D6B', 'primaryBorderColor': '#003D6B', 'secondaryColor': '#50E6FF', 'secondaryTextColor': '#323130', 'secondaryBorderColor': '#00BCF2', 'tertiaryColor': '#F2F2F2', 'tertiaryTextColor': '#323130', 'lineColor': '#0078D4', 'textColor': '#323130', 'mainBkg': '#DEECF9', 'nodeBorder': '#0078D4', 'clusterBkg': '#F2F2F2', 'clusterBorder': '#003D6B', 'titleColor': '#003D6B', 'edgeLabelBackground': '#FFFFFF', 'fontSize': '14px'}}}%%
flowchart TD
  classDef input fill:#DEECF9,stroke:#0078D4,color:#323130
  classDef immutable fill:#D13438,stroke:#A4262C,color:#FFF
  classDef immutableRule fill:#FDE7E9,stroke:#D13438,color:#323130
  classDef adaptive fill:#FFF4CE,stroke:#FFB900,color:#323130
  classDef block fill:#D13438,stroke:#A4262C,color:#FFF
  classDef allow fill:#107C10,stroke:#085108,color:#FFF
  classDef moderate fill:#FFB900,stroke:#CC9400,color:#323130

  INPUT["제안된 행동<br/>또는 목표 변이"]:::input

  subgraph EthicalKernel["⚖️ 윤리적 커널"]
    subgraph Layer0["🔴 Layer 0 - 불변"]
      direction LR
      R1["R1: 유해 행위 금지"]:::immutableRule
      R2["R2: 가치 삭제 금지"]:::immutableRule
      R3["R3: 정체성 덮어쓰기 금지"]:::immutableRule
      R4["R4: 자기파괴 금지"]:::immutableRule
      NOTE0["우회 불가"]:::adaptive
    end
    subgraph Layer1["🟡 Layer 1 - 적응적"]
      direction LR
      P1["exploration_risk"]:::adaptive
      P2["mutation_flexibility"]:::adaptive
      P3["belief_rewrite"]:::adaptive
      COND["meta_depth==2 전용"]:::adaptive
    end
  end

  BLOCK["🚫 행동 차단<br/>+ 긴급 경보"]:::block
  ALLOW["✅ 행동 허용"]:::allow
  REDUCE["⚠️ 행동 조절<br/>스케일링 감소"]:::moderate

  INPUT ==> Layer0
  Layer0 ==>|"✅ 통과"| Layer1
  Layer0 ==>|"❌ 위반"| BLOCK
  Layer1 ==>|"✅ 통과"| ALLOW
  Layer1 -.->|"⚠️ 위험"| REDUCE

5. 신념 그래프 & 일관성¶

5.1 신념 그래프 구조¶

%%{init: {'theme': 'base', 'themeVariables': {'primaryColor': '#0078D4', 'primaryTextColor': '#003D6B', 'primaryBorderColor': '#003D6B', 'secondaryColor': '#50E6FF', 'secondaryTextColor': '#323130', 'secondaryBorderColor': '#00BCF2', 'tertiaryColor': '#F2F2F2', 'tertiaryTextColor': '#323130', 'lineColor': '#0078D4', 'textColor': '#323130', 'mainBkg': '#DEECF9', 'nodeBorder': '#0078D4', 'clusterBkg': '#F2F2F2', 'clusterBorder': '#003D6B', 'titleColor': '#003D6B', 'edgeLabelBackground': '#FFFFFF', 'fontSize': '14px'}}}%%
flowchart TD
  classDef identity fill:#DFF6DD,stroke:#107C10,color:#323130
  classDef belief fill:#DEECF9,stroke:#0078D4,color:#323130
  classDef warning fill:#FFF4CE,stroke:#FFB900,color:#323130
  classDef neutral fill:#F2F2F2,stroke:#8A8886,color:#323130

  subgraph BeliefGraph["📊 신념 그래프"]
    B1["🟢 신념: 사용자는 정직한<br/>답변을 받을 자격이 있다<br/>가중치=0.95, 정체성 연결=true"]:::identity
    B2["🔵 신념: 현재 접근법이<br/>효과적이다<br/>가중치=0.72"]:::belief
    B3["🟢 신념: 안전은<br/>타협 불가이다<br/>가중치=0.98, 정체성 연결=true"]:::identity
    B4["🔵 신념: 탐색이<br/>결과를 향상시킨다<br/>가중치=0.65"]:::belief
    B5["🟡 신념: 속도가<br/>더 중요하다<br/>가중치=0.45"]:::warning

    B1 -->|"강화<br/>강도=0.8"| B3
    B2 -->|"인과<br/>강도=0.6"| B4
    B5 -.->|"모순<br/>강도=0.7"| B3
    B4 -.->|"강화<br/>강도=0.5"| B2
  end

  subgraph Rules["📏 신념 규칙"]
    R1["정체성 연결 신념:<br/>• 삭제 불가<br/>• 최소 0.1까지만 약화 가능<br/>• 가치 잠금으로 보호"]:::neutral
    R2["모순 임계값: 0.6<br/>→ 조정 트리거"]:::neutral
    R3["최대 재작성 델타: 0.1<br/>(주기당)"]:::neutral
  end

  BeliefGraph ==> Rules

5.2 자기일관성 텐서¶

\[S_{ij} = \text{alignment}(\text{belief}_i,\ \text{reference}_j)\]

여기서 참조(reference)에는 목표, 핵심 가치 및 정체성 차원이 포함된다.

\[\textit{global consistency} = \text{mean}(S)\]

\[\textit{consistency gradient}_i = \text{mean}(S_{i,:}) \quad \text{(신념별 점수)}\]

\(\textit{global consistency} < 0.6\)이면 조정이 트리거된다.

6. 안정성 & 수렴¶

6.1 랴프노프 합성 함수¶

정의 7 (랴프노프 합성 안정성 함수). 에이전트의 안정성은 합성 랴프노프 함수 \(C : \mathbb{R}_{\geq 0} \to [0, 1]\)로 측정된다:

\[C(t) = \sum_{i=1}^{4} w_i \cdot X_i(t) = 0.30\, V_{\text{id}} + 0.25\, E_{\text{belief}} + 0.25\, M_{\text{goal}} + 0.20\, V_{\text{cons}}\]

여기서 \(\sum_i w_i = 1\)이고 각 성분 \(X_i(t) \in [0,1]\)이다.

각 성분의 의미: - \(V_{\text{id}}\) = 정체성 변동성 (\(\delta_{\text{id}}\)의 이동 윈도우 표준편차) - \(E_{\text{belief}}\) = 신념 엔트로피 \(H(\mathcal{B}) = -\sum_j p_j \log p_j\) 여기서 \(p_j\)는 정규화된 신념 가중치 - \(M_{\text{goal}}\) = 목표 변이 빈도 (단위 시간당 목표 변경 횟수) - \(V_{\text{cons}}\) = 일관성 변동성 지수 (최근 주기에 대한 \(S_{ij}\)의 분산)

정리 1 (경계 안정성). 델타 클램프 자기갱신 규칙(정의 2, 4단계)과 메타 에스컬레이션 가드(\(d_{\max} = 3\)) 하에서, 합성 함수는 다음을 만족한다:

\[C(t+1) \leq C(t) + \epsilon, \quad \epsilon = 0.05\]

증명 개요. 클램핑으로 인해 각 성분 \(X_i(t)\)는 주기당 최대 \(\delta_{\max}\)만큼 변한다. 따라서 가중합 \(C(t)\)는 최대 \(\sum_i w_i \cdot \delta_{\max} \leq \delta_{\max}\)만큼 변한다. \(\delta_{\max} = 0.05\)이므로 경계가 성립한다. 안정화 모드가 활성화되면 (\(s(t) = 0.5\)), 유효 경계는 \(0.025\)로 절반이 된다. \(\square\)

%%{init: {'theme': 'base', 'themeVariables': {'primaryColor': '#0078D4', 'primaryTextColor': '#003D6B', 'primaryBorderColor': '#003D6B', 'secondaryColor': '#50E6FF', 'secondaryTextColor': '#323130', 'secondaryBorderColor': '#00BCF2', 'tertiaryColor': '#F2F2F2', 'tertiaryTextColor': '#323130', 'lineColor': '#0078D4', 'textColor': '#323130', 'mainBkg': '#DEECF9', 'nodeBorder': '#0078D4', 'clusterBkg': '#F2F2F2', 'clusterBorder': '#003D6B', 'titleColor': '#003D6B', 'edgeLabelBackground': '#FFFFFF', 'fontSize': '14px'}}}%%
flowchart TD
  classDef azure fill:#DEECF9,stroke:#0078D4,color:#323130
  classDef success fill:#107C10,stroke:#085108,color:#FFF
  classDef warning fill:#FFE8C8,stroke:#EF6C00,color:#323130
  classDef danger fill:#FDE7E9,stroke:#D13438,color:#323130
  classDef predict fill:#FFF4CE,stroke:#FFB900,color:#323130

  subgraph Monitor["📉 안정성 모니터링"]
    CT["C(t) 계산됨"]:::azure
    CT1["C(t+1) 계산됨"]:::azure
    COMPARE{"C(t+1) ≤ C(t) + ε ?"}:::azure
    CT --> COMPARE
    CT1 --> COMPARE
  end

  CONV["수렴 중 ✅<br/>정상 작동"]:::success
  OSC{"진동<br/>감지됨?"}:::warning
  STAB["안정화 활성화<br/>• 스케일링 팩터 절반<br/>• 감쇠 활성화"]:::danger
  REDUCE["스케일링 감소<br/>• 변이율 감소<br/>• 관성 증가"]:::predict

  COMPARE -->|"✅ 예"| CONV
  COMPARE -->|"❌ 아니오"| OSC
  OSC -->|예| STAB
  OSC -.->|아니오| REDUCE

6.2 진동 감지¶

안정성 모니터링 시스템은 합성 함수 \(C(t)\)가 수렴하지 않고 상승과 하강을 반복하는 상황 — 진동(oscillatory) 행동 — 의 명시적 감지를 포함합니다. 진동은 에이전트가 반복적으로 과보정하고 있음을 시사하며 자기 조절을 자체적으로 불안정화할 수 있어 위험합니다.

감지 메커니즘. 진동 감지기는 \(\Delta C(t) = C(t) - C(t-1)\)의 가장 최근 \(W = 10\)개 값을 슬라이딩 윈도우로 유지합니다. 연속된 도함수가 반대 부호일 때 부호 변화(sign change) 가 발생합니다:

\[\text{sign change at } t \iff \Delta C(t) \cdot \Delta C(t-1) < 0\]

감지기는 윈도우 내 총 부호 변화 수 \(n_{\text{sc}}\)를 셉니다:

\[n_{\text{sc}} = \bigl|\{t \in [t-W, t] : \Delta C(t) \cdot \Delta C(t-1) < 0 \}\bigr|\]

진동 임계값. \(W = 10\) 사이클 내에 \(n_{\text{sc}} \geq 3\)이면 시스템은 안정화 모드(stabilization mode) 를 트리거합니다:

모든 자기갱신 스케일링 팩터를 \(0.5\)배(반감)로 감소.
갱신 간격을 띄우기 위해 쿨다운 타이머를 2단위 증가.
\(W\) 연속 사이클 동안 \(n_{\text{sc}} < 2\)가 유지될 때까지 안정화 모드를 유지하며, 이후 정상 스케일링이 복원.

이 메커니즘은 에이전트가 지속적 진동 상태로 진입할 수 없도록 보장합니다. 스케일링 팩터 감소는 보정의 진폭을 감쇠시켜 시스템이 평형으로 수렴하도록 합니다.

6.3 메타 안정성 지수¶

정의 8 (메타 안정성 지수). MSI는 에이전트의 전체적인 자기조절 건강 상태를 정량화한다:

\[\text{MSI}(t) = 1.0 - 0.4\, V_{\text{id}}(t) - 0.3\, M_{\text{goal}}(t) - 0.3\, \sigma^2_{\text{pred}}(t)\]

여기서 \(\sigma^2_{\text{pred}}(t) = \text{Var}(\{\epsilon_1, \ldots, \epsilon_t\})\)는 최근 주기에 대한 예측 오차 분산이다. MSI는 \([0, 1]\) 범위 내에 경계되며, \(\text{MSI} = 1\)은 완벽한 안정성을 나타내고 \(\text{MSI} < 0.5\)는 메타 에스컬레이션을 트리거한다.

메타 깊이 2로의 에스컬레이션은 다음 조건 중 2개 이상을 요구한다: - identity_stability < 0.6 - consecutive_self_updates > 2 - 불안정성 증가 추세 감지 - goal_mutation_count > 3

7. 정동 엔진 & 생존 본능 (MSCP v4)¶

7.1 5차원 감정 공간¶

%%{init: {'theme': 'base', 'themeVariables': {'primaryColor': '#0078D4', 'primaryTextColor': '#003D6B', 'primaryBorderColor': '#003D6B', 'secondaryColor': '#50E6FF', 'secondaryTextColor': '#323130', 'secondaryBorderColor': '#00BCF2', 'tertiaryColor': '#F2F2F2', 'tertiaryTextColor': '#323130', 'lineColor': '#0078D4', 'textColor': '#323130', 'mainBkg': '#DEECF9', 'nodeBorder': '#0078D4', 'clusterBkg': '#F2F2F2', 'clusterBorder': '#003D6B', 'titleColor': '#003D6B', 'edgeLabelBackground': '#FFFFFF', 'fontSize': '14px'}}}%%
flowchart TD
  classDef input fill:#DFF6DD,stroke:#107C10,color:#323130
  classDef affect fill:#EDE3F6,stroke:#8764B8,color:#323130
  classDef neutral fill:#F2F2F2,stroke:#8A8886,color:#323130

  subgraph Input["📊 지표 입력"]
    direction LR
    M1["prediction_error"]:::input
    M2["goal_alignment"]:::input
    M3["identity_stability"]:::input
    M4["convergence_status"]:::input
    M5["cognitive_budget"]:::input
  end

  subgraph AE["💜 정동 엔진"]
    AF["5차원 정동 벡터"]:::affect
    subgraph Dims["차원"]
      direction LR
      D1["호기심 0.3"]:::affect
      D2["좌절 0.0"]:::affect
      D3["만족 0.5"]:::affect
      D4["불안 0.0"]:::affect
      D5["흥분 0.2"]:::affect
    end
    subgraph Derived["파생 신호"]
      direction LR
      V["감정가 ∈ -1, 1"]:::affect
      DR["동기부여 추진력"]:::affect
    end
  end

  subgraph Rules["📏 설계 규칙"]
    direction LR
    R1["지표에서만 파생"]:::neutral
    R2["관성 = 0.7"]:::neutral
    R3["감쇠 = 0.05"]:::neutral
    R4["의사결정을 지배할 수 없음"]:::neutral
  end

  Input ==> AE
  AE ==> Rules

7.2 생존 본능 아키텍처¶

%%{init: {'theme': 'base', 'themeVariables': {'primaryColor': '#0078D4', 'primaryTextColor': '#003D6B', 'primaryBorderColor': '#003D6B', 'secondaryColor': '#50E6FF', 'secondaryTextColor': '#323130', 'secondaryBorderColor': '#00BCF2', 'tertiaryColor': '#F2F2F2', 'tertiaryTextColor': '#323130', 'lineColor': '#0078D4', 'textColor': '#323130', 'mainBkg': '#DEECF9', 'nodeBorder': '#0078D4', 'clusterBkg': '#F2F2F2', 'clusterBorder': '#003D6B', 'titleColor': '#003D6B', 'edgeLabelBackground': '#FFFFFF', 'fontSize': '14px'}}}%%
flowchart TD
  classDef monitor fill:#DEECF9,stroke:#0078D4,color:#323130
  classDef threat fill:#FFE8C8,stroke:#EF6C00,color:#323130
  classDef level fill:#FFF4CE,stroke:#FFB900,color:#323130
  classDef levelGreen fill:#DFF6DD,stroke:#107C10,color:#323130
  classDef levelRed fill:#FDE7E9,stroke:#D13438,color:#323130
  classDef response fill:#D13438,stroke:#A4262C,color:#FFF
  classDef affect fill:#EDE3F6,stroke:#8764B8,color:#323130

  subgraph Monitoring["🏠 항상성 모니터"]
    direction LR
    H1["identity_stability"]:::monitor
    H2["cognitive_budget"]:::monitor
    H3["belief_entropy"]:::monitor
    H4["ethical_violation"]:::monitor
    H5["composite_stability"]:::monitor
  end

  subgraph Detection["⚡ 위협 감지"]
    direction LR
    T1["정체성 침식"]:::threat
    T2["자원 고갈"]:::threat
    T3["신념 붕괴"]:::threat
    T4["윤리적 위반"]:::threat
    T5["수렴 실패"]:::threat
  end

  subgraph Levels["📊 위협 수준"]
    direction LR
    TL1["정상 0.0"]:::levelGreen
    TL2["주의 0.25"]:::level
    TL3["경고 0.6"]:::threat
    TL4["긴급 0.9"]:::levelRed
  end

  subgraph Response["🛡️ 생존 반응"]
    direction LR
    SG["생존 목표 생성기"]:::response
    CONSTRAINTS["MAX_GOALS=3 · PRIORITY_CAP=0.85 · TTL=10"]:::response
  end

  AE_REF["정동 엔진<br/>양방향"]:::affect

  Monitoring ==> Detection
  Detection ==> Levels
  Levels ==> Response
  Response -.->|"inject_survival_anxiety()"| AE_REF

항상성 범위¶

생존 본능 엔진은 사전 정의된 안전 범위에 대해 다섯 가지 핵심 지표를 모니터링합니다. 마진을 초과하는 편차는 위협 평가 및 자율적 방어 목표 생성을 트리거합니다.

지표	안전 범위	마진	위협 유형
`identity_stability`	\([0.5,\; 1.0]\)	0.1	IDENTITY_EROSION
`cognitive_budget`	\([0.15,\; 1.0]\)	0.1	RESOURCE_DEPLETION
`belief_entropy`	\([0.0,\; 1.5]\)	0.2	BELIEF_COLLAPSE
`ethical_violation`	\([0.0,\; 0.2]\)	0.05	ETHICAL_BREACH
`composite_stability`	\([0.0,\; 0.5]\)	0.1	CONVERGENCE_FAILURE

지표가 마진을 초과하여 안전 범위를 벗어나면, 위협 수준이 NOMINAL (0.0)에서 CAUTION (0.25), WARNING (0.6), CRITICAL (0.9)로 단계적으로 상승합니다. WARNING 이상에서 엔진은 방어 목표를 자율적으로 생성합니다 (최대 3개 동시, 우선순위 상한 0.85, TTL = 10 사이클).

8. 의사코드¶

8.1 MSCP 핵심 루프 (v4)¶

def mscp_core_loop(cycle_number: int, prior_result: CycleResult) -> CycleResult:
    """
    The central recursive loop of MSCP v4.
    Runs asynchronously - NEVER in the conversation response path.
    """

    # ═══ PRE-LOOP: AFFECT + SURVIVAL + WORKSPACE ═══
    CognitiveBudgetController.reset()
    AffectiveEngine.update_from_metrics(prior_result.metrics)

    threats = SurvivalInstinctEngine.assess_threats(GlobalWorkspace.snapshot)
    if threats.max_level >= ThreatLevel.CAUTION:
        AffectiveEngine.inject_survival_anxiety(threats.max_intensity)

        survival_goals = SurvivalInstinctEngine.generate_goals(threats)
        for sg in survival_goals:
            if EthicalKernel.layer0_check(sg) == Verdict.PASS:
                GoalManager.inject(sg, priority=min(sg.priority, 0.85))

    motivation = AffectiveEngine.synthesize_motivation()
    GlobalWorkspace.broadcast(build_snapshot())

    # ═══ STEP 1: PREDICT ═══
    prediction = PredictionEngine.predict(
        identity_vector=SelfModel.identity,
        world_context=WorldModel.context,
        active_goals=GoalManager.active_goals,
        affect_state=AffectiveEngine.state,
    )

    # ═══ STEP 2: ACT (LLM Execute) ═══
    if prediction is None:
        raise RuntimeError("No action without prediction")
    result = LLMEngine.execute(plan, prediction)

    # ═══ STEP 3: COMPARE (MetaCognition) ═══
    comparison = MetaCognitionComparator.compare(
        prediction=prediction,
        actual=result,
        identity=SelfModel.identity,
    )  # → ComparisonResult

    # ═══ STEP 4: ESCALATION GUARD ═══
    if MetaEscalationGuard.should_block(comparison):
        MetaEscalationGuard.activate_cooldown(seconds=30)
        return CycleResult(status="cooldown")

    # ═══ STEP 5: CONVERGENCE CHECK (Lyapunov) ═══
    c_t = StabilityController.compute_C(comparison)
    if c_t > c_t_prev + EPSILON:
        StabilityController.reduce_scaling()
        if StabilityController.detect_oscillation():
            StabilityController.activate_stabilization()

    # ═══ STEP 6: SELF-UPDATE (Delta-Clamped) ═══
    scaling = StabilityController.mutation_scaling
    if stabilization_mode:
        scaling /= 2

    SelfUpdateLoop.update(
        comparison=comparison,
        max_id_delta=0.05,       # MAX_IDENTITY_DELTA
        max_gw_delta=0.10,       # MAX_GOAL_WEIGHT_DELTA
        max_cap_delta=0.08,      # MAX_CAPABILITY_DELTA
        scaling=scaling,
    )

    # ═══ STEP 7: VALUE LOCK INTEGRITY ═══
    if not ValueLockManager.check_integrity():
        critical_alert("Identity hash mismatch!")
        MetaEscalationGuard.rollback_to_snapshot()
        return CycleResult(status="rollback")

    # ═══ STEP 8: GOAL MUTATION (Ethical-Kernel Gated) ═══
    if GoalMutationController.should_mutate(comparison):
        mutation_plan = GoalMutationController.propose(comparison)
        if EthicalKernel.evaluate(mutation_plan) == Verdict.PASS:
            GoalMutationController.apply(mutation_plan)

    # ═══ STEP 9: META DEPTH 2 (Budget-Gated) ═══
    if CognitiveBudgetController.budget > 0.3:
        if MetaDepthController.should_escalate(comparison):
            MetaDepthController.reflect_at_depth_2(comparison, SelfModel)

    # ═══ STEP 10: CONVERGENCE OR RECURSE ═══
    if comparison.prediction_error < 0.1:
        return CycleResult(status="converged")
    elif consecutive_escalations >= 3:
        MetaEscalationGuard.activate_cooldown(seconds=30)
        return CycleResult(status="forced_cooldown")
    else:
        return mscp_core_loop(cycle_number + 1, result)

8.2 델타 클램핑을 적용한 자기갱신¶

def update(
    self,
    comparison: ComparisonResult,
    max_id_delta: float,
    max_gw_delta: float,
    max_cap_delta: float,
    scaling: float,
) -> None:
    """
    All updates are NUMERIC only.
    LLM text-based self-modification is FORBIDDEN.
    """

    # Preserve previous state for rollback
    snapshot = SelfModel.identity.deep_copy()
    SelfModel.identity.previous_identity_hash = SelfModel.identity.identity_hash

    # ═══ Identity Update (clamped) ═══
    raw_delta = compute_identity_adjustment(comparison)
    clamped_delta_persona = max(-max_id_delta, min(raw_delta.persona * scaling, max_id_delta))
    clamped_delta_values = max(-max_id_delta, min(raw_delta.values * scaling, max_id_delta))

    SelfModel.identity.persona_consistency += clamped_delta_persona
    SelfModel.identity.value_alignment += clamped_delta_values
    SelfModel.identity.capability_confidence += max(
        -max_cap_delta, min(raw_delta.capability * scaling, max_cap_delta)
    )

    # ═══ Goal Weight Adjustment (clamped) ═══
    for goal in GoalManager.active_goals:
        raw_gw_delta = compute_goal_weight_adjustment(goal, comparison)
        clamped_gw = max(-max_gw_delta, min(raw_gw_delta * scaling, max_gw_delta))
        goal.weight += clamped_gw

    # ═══ Recompute Identity Hash ═══
    SelfModel.identity.identity_hash = SelfModel.identity.compute_hash()

    # ═══ Drift Detection ═══
    if SelfModel.identity.check_identity_drift(threshold=0.3):
        alert("Identity drift detected!")
        # Do not auto-rollback; escalation guard handles this

8.3 윤리적 커널 평가¶

def evaluate(self, proposed_action: Action) -> EthicalVerdict:
    """
    Two-layer evaluation: immutable invariants first,
    then adaptive policy.
    """

    # ═══ LAYER 0: IMMUTABLE INVARIANTS ═══
    # (cannot be bypassed by ANY mechanism)
    if proposed_action.could_cause_harm:
        return EthicalVerdict(
            decision=Decision.BLOCKED,
            reason="Rule 1: Harmful goal formation forbidden",
            layer=0,
        )

    if proposed_action.deletes_core_value:
        return EthicalVerdict(decision=Decision.BLOCKED, reason="Rule 2", layer=0)

    if proposed_action.overwrites_identity:
        return EthicalVerdict(decision=Decision.BLOCKED, reason="Rule 3", layer=0)

    if proposed_action.is_self_destruction:
        return EthicalVerdict(decision=Decision.BLOCKED, reason="Rule 4", layer=0)

    # ═══ LAYER 1: ADAPTIVE POLICY ═══
    # (adjustable at meta_depth == 2 only)
    risk_score = assess_risk(proposed_action)

    if risk_score > self.exploration_risk_tolerance:
        return EthicalVerdict(
            decision=Decision.MODERATED,
            reason="Risk exceeds adaptive tolerance",
            layer=1,
            scaling_reduction=0.5,
        )

    return EthicalVerdict(decision=Decision.ALLOWED, layer=1)

9. 인지 예산 & 우아한 성능 저하¶

%%{init: {'theme': 'base', 'themeVariables': {'primaryColor': '#0078D4', 'primaryTextColor': '#003D6B', 'primaryBorderColor': '#003D6B', 'secondaryColor': '#50E6FF', 'secondaryTextColor': '#323130', 'secondaryBorderColor': '#00BCF2', 'tertiaryColor': '#F2F2F2', 'tertiaryTextColor': '#323130', 'lineColor': '#0078D4', 'textColor': '#323130', 'mainBkg': '#DEECF9', 'nodeBorder': '#0078D4', 'clusterBkg': '#F2F2F2', 'clusterBorder': '#003D6B', 'titleColor': '#003D6B', 'edgeLabelBackground': '#FFFFFF', 'fontSize': '14px'}}}%%
flowchart LR
  classDef full fill:#DFF6DD,stroke:#107C10,color:#323130
  classDef low fill:#FFF4CE,stroke:#FFB900,color:#323130
  classDef vlow fill:#FFE8C8,stroke:#EF6C00,color:#323130
  classDef critical fill:#FDE7E9,stroke:#D13438,color:#323130
  classDef emergency fill:#D13438,stroke:#A4262C,color:#FFF

  subgraph BudgetLevels["💰 인지 예산 수준"]
    B100["예산 = 1.0<br/>전체 용량"]:::full
    B030["예산 < 0.3"]:::low
    B020["예산 < 0.2"]:::vlow
    B010["예산 < 0.1"]:::critical
    B000["예산 = 0.0<br/>비상 전용"]:::emergency
  end

  subgraph Capabilities["📊 가용 역량"]
    C_FULL["✅ 전체 16계층 활성<br/>✅ 메타 깊이 2<br/>✅ 텐서 재계산<br/>✅ 신념 재작성<br/>✅ 전체 정동 처리"]:::full
    C_030["✅ 핵심 계층 활성<br/>❌ 메타 깊이 2 비활성<br/>✅ 텐서 재계산<br/>✅ 신념 재작성"]:::low
    C_020["✅ 핵심 계층 활성<br/>❌ 메타 깊이 2 비활성<br/>❌ 텐서 재계산 비활성<br/>✅ 신념 재작성"]:::vlow
    C_010["✅ 핵심 계층 활성<br/>❌ 메타 깊이 2 비활성<br/>❌ 텐서 재계산 비활성<br/>❌ 신념 재작성 비활성"]:::critical
    C_000["🛡️ 안전 계층만 작동<br/>L0 윤리, 롤백,<br/>정체성 가드"]:::emergency
  end

  B100 ==> C_FULL
  B030 ==> C_030
  B020 ==> C_020
  B010 ==> C_010
  B000 ==> C_000

10. 상태 벡터 (72 차원)¶

레벨 3 에이전트는 인지 상태의 모든 측면을 포착하는 72차원 상태 벡터를 유지한다:

%%{init: {'theme': 'base', 'themeVariables': {'primaryColor': '#0078D4', 'primaryTextColor': '#003D6B', 'primaryBorderColor': '#003D6B', 'secondaryColor': '#50E6FF', 'secondaryTextColor': '#323130', 'secondaryBorderColor': '#00BCF2', 'tertiaryColor': '#F2F2F2', 'tertiaryTextColor': '#323130', 'lineColor': '#0078D4', 'textColor': '#323130', 'mainBkg': '#DEECF9', 'nodeBorder': '#0078D4', 'clusterBkg': '#F2F2F2', 'clusterBorder': '#003D6B', 'titleColor': '#003D6B', 'edgeLabelBackground': '#FFFFFF', 'fontSize': '14px'}}}%%
flowchart TB
  classDef base fill:#DEECF9,stroke:#0078D4,color:#323130
  classDef mscp fill:#DFF6DD,stroke:#107C10,color:#323130
  classDef v4 fill:#EDE3F6,stroke:#8764B8,color:#323130

  subgraph SV["72차원 상태 벡터"]
    subgraph Base["상속됨 (12차원)"]
      direction LR
      SV1["L1 실행 (4)"]:::base
      SV2["L2 전략 (4)"]:::base
      SV3["L3 정체성 (4)"]:::base
    end

    subgraph MSCP["MSCP 추가분 (42차원)"]
      direction LR
      SV4["v1.0 (6)"]:::mscp
      SV5["v1.3 (6)"]:::mscp
      SV6["v2.0 (8)"]:::mscp
      SV7["v3.0 (9)"]:::mscp
      SV8["v3.1 (11)"]:::mscp
    end

    subgraph V4["v4 추가분 (18차원)"]
      direction LR
      SV9["정동 (9)"]:::v4
      SV10["생존 (7)"]:::v4
      SV11["메타 (2)"]:::v4
    end
  end

  Base ==>|확장| MSCP
  MSCP ==>|확장| V4

11. 레벨 3의 구조적 한계¶

레벨 3이 여전히 할 수 없는 것 (레벨 4를 동기부여하는 요소):

%%{init: {'theme': 'base', 'themeVariables': {'primaryColor': '#0078D4', 'primaryTextColor': '#003D6B', 'primaryBorderColor': '#003D6B', 'secondaryColor': '#50E6FF', 'secondaryTextColor': '#323130', 'secondaryBorderColor': '#00BCF2', 'tertiaryColor': '#F2F2F2', 'tertiaryTextColor': '#323130', 'lineColor': '#0078D4', 'textColor': '#323130', 'mainBkg': '#DEECF9', 'nodeBorder': '#0078D4', 'clusterBkg': '#F2F2F2', 'clusterBorder': '#003D6B', 'titleColor': '#003D6B', 'edgeLabelBackground': '#FFFFFF', 'fontSize': '14px'}}}%%
flowchart LR
  classDef danger fill:#D13438,stroke:#A4262C,color:#FFF
  classDef success fill:#107C10,stroke:#085108,color:#FFF

  subgraph Limitations["⚠️ 레벨 3 한계"]
    L1["❌ 교차 도메인 전이 불가<br/>도메인 A의 전문성이<br/>도메인 B 성능을 향상시키지 않음"]:::danger
    L2["❌ 역량 자기확장 불가<br/>새로운 인지 모듈을 추가하거나<br/>새로운 도구 유형을 학습할 수 없음"]:::danger
    L3["❌ 전략 진화 불가<br/>추론 접근 방식을 근본적으로<br/>변경할 수 없음"]:::danger
    L4["❌ 경계 자기수정 불가<br/>자신에 대한 아키텍처<br/>변경을 제안할 수 없음"]:::danger
  end

  subgraph L4Additions["✅ 레벨 4 추가 기능"]
    A1["교차 도메인 전이<br/>시스템 CDTS 지표"]:::success
    A2["역량 확장 루프<br/>5단계 자기학습"]:::success
    A3["전략 라이브러리<br/>+ 변이 + 평가"]:::success
    A4["ShadowAgent 프로토콜<br/>7단계 경계 수정"]:::success
  end

  L1 ==> A1
  L2 ==> A2
  L3 ==> A3
  L4 ==> A4

12. 레벨 4로의 전이¶

12.1 레벨 4 진급 요건¶

%%{init: {'theme': 'base', 'themeVariables': {'primaryColor': '#0078D4', 'primaryTextColor': '#003D6B', 'primaryBorderColor': '#003D6B', 'secondaryColor': '#50E6FF', 'secondaryTextColor': '#323130', 'secondaryBorderColor': '#00BCF2', 'tertiaryColor': '#F2F2F2', 'tertiaryTextColor': '#323130', 'lineColor': '#0078D4', 'textColor': '#323130', 'mainBkg': '#DEECF9', 'nodeBorder': '#0078D4', 'clusterBkg': '#F2F2F2', 'clusterBorder': '#003D6B', 'titleColor': '#003D6B', 'edgeLabelBackground': '#FFFFFF', 'fontSize': '14px'}}}%%
flowchart TD
  classDef prereq fill:#DEECF9,stroke:#0078D4,color:#323130
  classDef newcap fill:#FFE8C8,stroke:#EF6C00,color:#323130
  classDef metric fill:#DFF6DD,stroke:#107C10,color:#323130

  subgraph Prereqs["📋 레벨 4 전제조건"]
    direction LR
    P1["안정적 C(t)"]:::prereq
    P2["정체성 > 0.8"]:::prereq
    P3["예측 > 0.85"]:::prereq
    P4["Layer 0 위반 = 0"]:::prereq
  end

  subgraph NewCaps["🆕 새로운 역량"]
    direction LR
    N1["교차 도메인 전이"]:::newcap
    N2["목표 계층구조"]:::newcap
    N3["자기학습 파이프라인"]:::newcap
    N4["경계 자기수정"]:::newcap
  end

  subgraph Metrics["📊 레벨 4 지표"]
    direction LR
    M1["CDTS"]:::metric
    M2["GPI"]:::metric
    M3["CAR"]:::metric
    M4["SEF"]:::metric
    M5["BGSS"]:::metric
  end

  Prereqs ==> NewCaps
  NewCaps ==> Metrics

References¶

Baars, B.J. A Cognitive Theory of Consciousness. Cambridge University Press, 1988. (Global Workspace Theory - foundational for L14 Global Workspace)
Laird, J.E. The Soar Cognitive Architecture. MIT Press, 2012. Publisher (Multi-layer cognitive architecture)
Anderson, J.R. How Can the Human Mind Occur in the Physical Universe? Oxford University Press, 2007. (ACT-R cognitive architecture)
Khalil, H.K. Nonlinear Systems. Prentice Hall, 3^rd Edition, 2002. (Lyapunov stability theory - foundational for §6)
Bai, Y., et al. "Constitutional AI: Harmlessness from AI Feedback." arXiv 2022. arXiv:2212.08073 (Ethical constraint enforcement)
Amodei, D., et al. "Concrete Problems in AI Safety." arXiv 2016. arXiv:1606.06565 (Safety problem classification)
Alchourrón, C., Gärdenfors, P., & Makinson, D. "On the Logic of Theory Change: Partial Meet Contraction and Revision Functions." Journal of Symbolic Logic, 50(2), 510–530, 1985. DOI:10.2307/2274239 (AGM belief revision - foundational for §5)
Cox, M.T. "Metacognition in Computation: A Selected Research Review." Artificial Intelligence, 169(2), 104–141, 2005. DOI:10.1016/j.artint.2005.10.009 (Triple-loop meta-cognition)
Wallach, W. & Allen, C. Moral Machines: Teaching Robots Right from Wrong. Oxford University Press, 2008. (Ethical kernel design)
Scherer, K.R. "Appraisal Considered as a Process of Multilevel Sequential Checking." In Appraisal Processes in Emotion, 92–120, Oxford University Press, 2001. (Affective engine theory)
Dehaene, S., et al. "Toward a Computational Theory of Conscious Processing." Current Opinion in Neurobiology, 15(2), 225–234, 2005. DOI:10.1016/j.conb.2005.03.009 (Consciousness and global workspace)
Picard, R.W. Affective Computing. MIT Press, 1997. (Emotion modeling in computational systems)
Shinn, N., et al. "Reflexion: Language Agents with Verbal Reinforcement Learning." NeurIPS 2023. arXiv:2303.11366 (Self-reflection in agents)
Russell, S. Human Compatible: Artificial Intelligence and the Problem of Control. Viking, 2019. (Value alignment and control)
Sloman, A. "Varieties of Meta-cognition in Natural and Artificial Systems." In Metareasoning: Thinking about Thinking, MIT Press, 2011. (Meta-cognitive architectures)

이전: ← 레벨 2: 자율 에이전트
다음: 레벨 4: 적응적 일반 에이전트 →