Pattern: Just-In-Time Generation (JITG)

pattern generation procedural performance

Context

Generating an entire game world upfront is expensive (time, tokens, memory) and wasteful—most content will never be encountered. Just-In-Time Generation defers content creation until the moment it’s needed, then caches the result for consistency.

Use this pattern when:

Building procedurally generated worlds
Token budgets are limited
Players explore non-linearly
World scale is large or unbounded
Generation latency is acceptable (1-5 seconds)

Forces

Competing concerns:

Upfront Cost vs Runtime Latency
- Full pre-generation: expensive upfront, instant access
- JIT: cheap startup, generates on-demand (adds latency)
Consistency vs Flexibility
- Pre-generated content is fixed and consistent
- JIT risks inconsistency if not cached properly
Detail Level vs Performance
- Deep generation (full NPC backstories) is slow
- Shallow generation (name only) is fast but limited
Context Requirements
- JIT needs parent context to generate children
- Must backpropagate up hierarchy to establish context

Solution

Structure

flowchart TD
    A[Player enters new location] --> B{Already generated?}
    B -->|Yes| C[Load from cache]
    B -->|No| D[Determine parent context]
    D --> E[Generate with LLM]
    E --> F[Validate output]
    F --> G[Cache to database]
    G --> H[Return to player]
    C --> H

    style D fill:#4a90e2
    style E fill:#e27d60
    style G fill:#50c878

Generation Strategies

1. Stub-Based JIT

Generate minimal “stubs” upfront, fill details on-demand:

# Initial stub (instant)
location = {
    "id": "tavern_001",
    "type": "tavern",
    "parent": "city_stonevale",
    "name": None,  # Generate when needed
    "description": None,  # Generate when needed
    "npcs": [],  # Generate when needed
}
 
# Full generation (when player enters)
def hydrate_location(stub):
    if not stub["name"]:
        stub["name"] = generate_name(stub["type"], stub["parent"])
    if not stub["description"]:
        stub["description"] = generate_description(stub)
    if not stub["npcs"]:
        stub["npcs"] = generate_npcs(stub, count=random.randint(2, 5))
    return stub

2. Hierarchical Backpropagation

When generating a location, ensure parent context exists:

def generate_location(location_id):
    """Generate location and ensure parent context exists"""
    location = db.get(location_id)
 
    # Backpropagate up the tree
    parent_chain = []
    current = location
    while current["parent"]:
        parent = db.get(current["parent"])
        if not parent["generated"]:
            parent_chain.append(parent)
        current = parent
 
    # Generate from top-down
    for parent in reversed(parent_chain):
        generate_and_cache(parent)
 
    # Finally generate target
    return generate_and_cache(location)

3. Lazy NPC Generation

NPCs don’t need full details until player interacts:

class NPC:
    def __init__(self, npc_id, location):
        self.id = npc_id
        self.location = location
        self.name = None  # Generated on first mention
        self.personality = None  # Generated on first conversation
        self.inventory = None  # Generated on first trade
        self.backstory = None  # Generated on deeper interaction
 
    def get_name(self):
        """Lazy name generation"""
        if not self.name:
            self.name = llm_generate_name(self.location, self.id)
            db.save_npc(self)
        return self.name
 
    def get_personality(self):
        """Lazy personality generation"""
        if not self.personality:
            self.personality = llm_generate_personality(
                name=self.name,
                location=self.location
            )
            db.save_npc(self)
        return self.personality

Implementation

Complete JITG System

from typing import Dict, Any, Optional, List
from dataclasses import dataclass, field
import hashlib
import json
 
@dataclass
class GeneratedEntity:
    """Cached generated content"""
    id: str
    type: str
    parent_id: Optional[str]
    data: Dict[str, Any]
    generated: bool = False
    children: List[str] = field(default_factory=list)
 
class JITGenerator:
    """Just-In-Time content generation with caching"""
 
    def __init__(self, llm_client, cache_db):
        self.llm = llm_client
        self.cache = cache_db
 
    def get_or_generate(self, entity_id: str, entity_type: str,
                       parent_id: Optional[str] = None) -> GeneratedEntity:
        """
        Get entity from cache or generate if needed.
        Ensures parent context exists before generation.
        """
        # Check cache first
        cached = self.cache.get(entity_id)
        if cached and cached.generated:
            return cached
 
        # Ensure parent context exists
        parent_context = None
        if parent_id:
            parent_context = self.get_or_generate(
                parent_id,
                self._infer_parent_type(entity_type),
                None
            )
 
        # Generate new entity
        entity = self._generate(entity_id, entity_type, parent_context)
 
        # Cache result
        self.cache.save(entity)
 
        return entity
 
    def _generate(self, entity_id: str, entity_type: str,
                  parent: Optional[GeneratedEntity]) -> GeneratedEntity:
        """Generate entity with LLM"""
        prompt = self._build_generation_prompt(entity_type, parent)
 
        response = self.llm.complete(
            prompt=prompt,
            temperature=0.8,
            max_tokens=300
        )
 
        data = self._parse_generation(response, entity_type)
 
        entity = GeneratedEntity(
            id=entity_id,
            type=entity_type,
            parent_id=parent.id if parent else None,
            data=data,
            generated=True
        )
 
        # Link to parent
        if parent:
            parent.children.append(entity_id)
            self.cache.save(parent)
 
        return entity
 
    def _build_generation_prompt(self, entity_type: str,
                                 parent: Optional[GeneratedEntity]) -> str:
        """Build context-aware generation prompt"""
        if entity_type == "location":
            return self._build_location_prompt(parent)
        elif entity_type == "npc":
            return self._build_npc_prompt(parent)
        elif entity_type == "item":
            return self._build_item_prompt(parent)
        else:
            raise ValueError(f"Unknown entity type: {entity_type}")
 
    def _build_location_prompt(self, parent: Optional[GeneratedEntity]) -> str:
        """Generate location within parent context"""
        if not parent:
            # Root location
            return """Generate a fantasy location:
- Name
- Type (city/tavern/dungeon/wilderness)
- Brief description (2-3 sentences)
- Atmosphere/mood
 
Format as JSON."""
 
        # Child location
        parent_data = parent.data
        return f"""Generate a location within {parent_data['name']}:
 
Parent Location: {parent_data['name']}
Parent Type: {parent.type}
Parent Description: {parent_data['description']}
 
Create a new location that fits this context:
- Name
- Type
- Description (2-3 sentences)
- How it connects to parent
 
Format as JSON."""
 
    def _build_npc_prompt(self, parent: Optional[GeneratedEntity]) -> str:
        """Generate NPC for location"""
        if not parent:
            raise ValueError("NPCs require parent location")
 
        location_data = parent.data
        return f"""Generate an NPC for {location_data['name']}:
 
Location: {location_data['name']} ({parent.type})
Atmosphere: {location_data.get('atmosphere', 'neutral')}
 
Create an NPC:
- Name
- Role/occupation
- Brief personality (1-2 traits)
- Reason they're here
 
Format as JSON."""
 
    def _build_item_prompt(self, parent: Optional[GeneratedEntity]) -> str:
        """Generate item for location/NPC"""
        context = parent.data if parent else {}
        return f"""Generate an item:
 
Context: {context.get('name', 'unknown location')}
 
Create an item:
- Name
- Type (weapon/armor/consumable/misc)
- Description
- Special properties (if any)
 
Format as JSON."""
 
    def _parse_generation(self, response: str, entity_type: str) -> Dict[str, Any]:
        """Parse LLM output into structured data"""
        try:
            # Try JSON parsing first
            return json.loads(response)
        except json.JSONDecodeError:
            # Fallback: extract with regex or use defaults
            return {
                "name": "Generated " + entity_type,
                "description": response[:200],
                "raw_output": response
            }
 
    def _infer_parent_type(self, child_type: str) -> str:
        """Infer parent type from child type"""
        hierarchy = {
            "room": "building",
            "building": "city",
            "city": "region",
            "region": "world",
            "npc": "location",
            "item": "location"
        }
        return hierarchy.get(child_type, "location")
 
    def generate_neighbors(self, entity_id: str, count: int = 3) -> List[GeneratedEntity]:
        """Generate neighboring entities (siblings)"""
        entity = self.cache.get(entity_id)
        if not entity:
            raise ValueError(f"Entity {entity_id} not found")
 
        neighbors = []
        for i in range(count):
            neighbor_id = f"{entity.parent_id}_sibling_{i}"
            if not self.cache.exists(neighbor_id):
                neighbor = self.get_or_generate(
                    neighbor_id,
                    entity.type,
                    entity.parent_id
                )
                neighbors.append(neighbor)
 
        return neighbors
 
 
# Cache implementation
class GenerationCache:
    """Simple cache for generated entities"""
 
    def __init__(self, db_path: str):
        self.db_path = db_path
        self._cache = {}
 
    def get(self, entity_id: str) -> Optional[GeneratedEntity]:
        """Get from memory cache or load from DB"""
        if entity_id in self._cache:
            return self._cache[entity_id]
 
        # Load from DB
        entity = self._load_from_db(entity_id)
        if entity:
            self._cache[entity_id] = entity
        return entity
 
    def save(self, entity: GeneratedEntity):
        """Save to memory and DB"""
        self._cache[entity.id] = entity
        self._save_to_db(entity)
 
    def exists(self, entity_id: str) -> bool:
        """Check if entity exists"""
        return entity_id in self._cache or self._exists_in_db(entity_id)
 
    def _load_from_db(self, entity_id: str) -> Optional[GeneratedEntity]:
        """Load from database"""
        # Implementation depends on DB choice
        # For simplicity, using JSON files
        import os
        path = f"{self.db_path}/{entity_id}.json"
        if os.path.exists(path):
            with open(path, 'r') as f:
                data = json.load(f)
                return GeneratedEntity(**data)
        return None
 
    def _save_to_db(self, entity: GeneratedEntity):
        """Save to database"""
        import os
        os.makedirs(self.db_path, exist_ok=True)
        path = f"{self.db_path}/{entity.id}.json"
        with open(path, 'w') as f:
            json.dump({
                "id": entity.id,
                "type": entity.type,
                "parent_id": entity.parent_id,
                "data": entity.data,
                "generated": entity.generated,
                "children": entity.children
            }, f, indent=2)
 
    def _exists_in_db(self, entity_id: str) -> bool:
        """Check if exists in DB"""
        import os
        return os.path.exists(f"{self.db_path}/{entity_id}.json")
 
 
# Usage Example
if __name__ == "__main__":
    # Mock LLM client
    class MockLLM:
        def complete(self, prompt, temperature, max_tokens):
            # Simulate LLM generation
            return '''
            {
                "name": "The Rusty Flagon",
                "type": "tavern",
                "description": "A dimly lit tavern with worn wooden tables and a crackling fireplace.",
                "atmosphere": "cozy but suspicious"
            }
            '''
 
    # Initialize
    cache = GenerationCache("./generated_cache")
    generator = JITGenerator(MockLLM(), cache)
 
    # Generate city (parent)
    city = generator.get_or_generate("city_001", "city", None)
    print(f"Generated city: {city.data.get('name', 'Unknown')}")
 
    # Generate tavern in city (JIT)
    tavern = generator.get_or_generate("tavern_001", "location", "city_001")
    print(f"Generated tavern: {tavern.data.get('name', 'Unknown')}")
 
    # Generate NPC in tavern (JIT)
    npc = generator.get_or_generate("npc_001", "npc", "tavern_001")
    print(f"Generated NPC: {npc.data.get('name', 'Unknown')}")
 
    # Second request for tavern (cached, no LLM call)
    tavern_cached = generator.get_or_generate("tavern_001", "location", "city_001")
    print(f"Cached tavern: {tavern_cached.data.get('name', 'Unknown')}")

Consequences

Benefits

Reduced Startup Time: Game starts immediately, no waiting for world generation
Token Efficiency: Only pay for content actually encountered
Scalability: World can be arbitrarily large
Memory Efficiency: Only active content loaded in memory
Flexibility: Can adjust generation quality based on player progression

Liabilities

Generation Latency: Players wait 1-5 seconds when entering new areas
Context Requirements: Must maintain parent hierarchy for coherent generation
Consistency Risk: Without caching, same entity could generate differently
Complexity: More complex than full pre-generation
Cache Management: Need to handle cache invalidation and updates

Performance Characteristics

Upfront cost:

Full generation: O(n) where n = all possible locations
JIT: O(1) - instant startup

Runtime cost:

Full generation: O(1) - instant access
JIT: O(g + c) where g = generation time, c = cache lookup

Memory:

Full generation: O(n) - entire world in memory
JIT: O(a) where a = active/visited locations

Optimization Strategies

Predictive Generation: Generate likely next locations during idle time
Layered Detail: Generate basic details immediately, enrich later
Batch Generation: Generate multiple related entities in one LLM call
Streaming Generation: Start displaying partial results while generating
Background Workers: Offload generation to background threads/processes

Hierarchical Cascade - Determines generation order
Scene-Based State - Triggers for JIT generation
Program-First Architecture - Where JIT fits
Template Meta-Generation - Alternative approach

Source

Original Discussions:

January 2024: Initial JIT concept
February 2024: veritasr’s implementation in ReallmCraft
Contributors: User-veritasr, User-50h100a, User-ycros, User-monkeyrithms

Key Quotes:

“probably need some sort of JIT generation.” - veritasr

“JIT gen everywhere possible… crowds are full of nameless people until the player stops and asks one what their name is” - 50h100a

“if you take a procgen approach, you can JIT the details without a bunch of effort. Like rimworld for example” - veritasr

“in the case of the tavern example above, the children represent important / rooms of note. Sort of working in a JITG (Just In Time Generation) mindset, where things don’t need to exist for the most part until they’re needed. But once they’re needed they need to be generated with fixed values, so rules can be applied to them.” - veritasr

“basically limitless potential until it exists as a fact. Once it becomes a fact, though it needs to be set in stone.” - veritasr

Referenced in:

Architecture and Design Thread

Implementation Notes

When to Generate

Immediate generation:

Player enters new location
Player asks about specific NPC
Quest requires specific item/location
Combat encounter needs enemy stats

Deferred generation:

Distant locations (not adjacent)
Background NPCs (until player interacts)
Item details (until player examines)
Full backstories (until player investigates)

Cache Invalidation

class CachePolicy:
    """Determine what can be regenerated"""
 
    @staticmethod
    def can_regenerate(entity: GeneratedEntity) -> bool:
        """Determine if entity can be regenerated"""
        # Never regenerate if player has interacted
        if entity.data.get("player_visited"):
            return False
 
        # Can regenerate if only referenced, not visited
        if entity.data.get("mentioned_only"):
            return True
 
        # Time-based: regenerate if not visited in 24h game time
        if entity.data.get("last_visit_time"):
            hours_since = get_game_hours_since(entity.data["last_visit_time"])
            return hours_since > 24
 
        return False

Error Handling

def safe_generate(generator, entity_id, entity_type, parent_id, max_retries=3):
    """Generate with fallback"""
    for attempt in range(max_retries):
        try:
            return generator.get_or_generate(entity_id, entity_type, parent_id)
        except Exception as e:
            if attempt == max_retries - 1:
                # Use template fallback
                return create_template_entity(entity_type)
            time.sleep(1)  # Brief delay before retry

Implementation in ChatBotRPG

Status: ✅ EXACT MATCH - Character and location generation are purely on-demand

Source Files:

src/generate/generate_actor.py - JIT character generation
src/generate/generate_setting.py - JIT location generation
src/editor_panel/actor_manager.py - Generation UI triggers

Production Example: On-Demand Character Generation

File: src/generate/generate_actor.py

ChatBotRPG generates characters only when user clicks “Generate” button in the actor editor:

def generate_actor_fields_async(actor_name, location, genre, fields_to_generate):
    """
    Called only when user explicitly requests generation.
    NO pre-generation at world creation.
    """
 
    if 'name' in fields_to_generate:
        generated_name = _generate_character_name(genre)
 
    if 'description' in fields_to_generate:
        generated_desc = _generate_character_description(name, genre)
 
    if 'personality' in fields_to_generate:
        generated_personality = _generate_personality(name, description, genre)
 
    # Only fields requested are generated

Why JIT: Saves LLM calls for unused characters - most NPCs in template library are never instantiated in actual gameplay.

Production Example: On-Demand Location Generation

File: src/generate/generate_setting.py

def generate_setting(world_data, region_name, parent_setting):
    """
    Called only when user clicks "Generate" button in setting editor.
    Location created only when needed.
    """
 
    prompt = f"Generate a new location in {parent_setting} within {region_name}..."
    generated_setting = make_inference(
        context=[{"role": "user", "content": prompt}],
        url_type=get_default_utility_model(),
        temperature=0.7
    )
 
    # Location not pre-generated, only when user requests

Two-Tier Generation Model

ChatBotRPG implements a two-tier approach:

Tier 1: Templates (World-Level)

Location: workflow_data_dir/resources/data files/
Contains: Reusable actor/setting/item templates
Scope: Cross-playthrough, immutable
Generation: JIT when Game Master designs world

Tier 2: Instances (Playthrough-Level)

Location: workflow_data_dir/game/
Contains: Active playthrough state (actors, settings, conversation)
Scope: Single playthrough, mutable
Generation: Copied from templates when first encountered

File References: src/core/character_inference.py (lines 117-152)

resources_dir = os.path.join(workflow_data_dir, 'resources', 'data files')
actors_dir = os.path.join(resources_dir, 'actors')  # Templates (JIT created by GM)
 
game_dir = os.path.join(workflow_data_dir, 'game')
session_actors_dir = os.path.join(game_dir, 'actors')  # Instances (JIT copied on first use)

Caching Strategy

From: src/core/character_inference.py (lines 1330-1351)

Once a character is instantiated in gameplay, ChatBotRPG creates a session actor file:

def _copy_resource_actor_to_game_if_not_exist(self, workflow_data_dir, character_name):
    """
    JIT instantiation: Copy template to game directory on first encounter.
    Subsequent references use cached session file.
    """
    game_actors_dir = os.path.join(workflow_data_dir, 'game', 'actors')
    session_actor_path = os.path.join(game_actors_dir, f"{character_name}.json")
 
    if os.path.exists(session_actor_path):
        return  # Already instantiated, use cache
 
    # First encounter - copy from template
    resource_actor_path = _find_actor_file_path(workflow_data_dir, character_name)
    if resource_actor_path:
        shutil.copy(resource_actor_path, session_actor_path)

Benefit: Template remains immutable, session instance can evolve (relationships, variables, memories).

Performance Metrics

Token Costs (from production analysis):

Character name: 256 tokens
Full character (7 fields): ~1,500 tokens total
Location: ~400 tokens

Latency: 1-5 seconds per generation (acceptable for editor workflow)

Cost Savings: If world has 100 NPCs but player only encounters 10 → 90% savings

When Generation Happens

Character Generation Triggers:

Game Master clicks “Generate” in actor editor
User explicitly requests character creation

Character Instantiation Triggers:

Character first mentioned in conversation
Character enters scene (timer-based or rule-based)

No Predictive Generation: ChatBotRPG does NOT pre-generate likely encounters - pure on-demand model.

Pattern-to-Code Mapping - JIT validation (lines 622-667)
Discord Claims Validation - JIT confirmed (lines 199-217)

LLM World Engine Knowledge Base

Explorer

jit-generation

Pattern: Just-In-Time Generation (JITG)

Context

Forces

Solution

Structure

Generation Strategies

1. Stub-Based JIT

2. Hierarchical Backpropagation

3. Lazy NPC Generation

Implementation

Complete JITG System

Consequences

Benefits

Liabilities

Performance Characteristics

Optimization Strategies

Source

Implementation Notes

When to Generate

Cache Invalidation

Error Handling

Implementation in ChatBotRPG

Production Example: On-Demand Character Generation

Production Example: On-Demand Location Generation

Two-Tier Generation Model

Caching Strategy

Performance Metrics

When Generation Happens

Tags

Graph View

Table of Contents

Backlinks

LLM World Engine Knowledge Base

Explorer

jit-generation

Pattern: Just-In-Time Generation (JITG)

Context

Forces

Solution

Structure

Generation Strategies

1. Stub-Based JIT

2. Hierarchical Backpropagation

3. Lazy NPC Generation

Implementation

Complete JITG System

Consequences

Benefits

Liabilities

Performance Characteristics

Optimization Strategies

Related Patterns

Source

Implementation Notes

When to Generate

Cache Invalidation

Error Handling

Implementation in ChatBotRPG

Production Example: On-Demand Character Generation

Production Example: On-Demand Location Generation

Two-Tier Generation Model

Caching Strategy

Performance Metrics

When Generation Happens

Related Implementation Files

Tags

Graph View

Table of Contents

Backlinks