GenAIIDP Web User Interface

The solution includes a responsive web-based user interface built with React that provides comprehensive document management and monitoring capabilities.

Web UI Screenshot

The GenAIIDP Web Interface showing the document tracking dashboard with status information, classification results, and extracted data.

Features

Document tracking and monitoring capabilities
Real-time status updates of document processing
Secure authentication using Amazon Cognito
Searchable document history
Detailed document processing metrics and status information
Inspection of processing outputs for section classification and information extraction
Accuracy evaluation reports when baseline data is provided
View and edit pattern configuration, including document classes, prompt engineering, and model settings
Manage multiple configuration versions — create, compare, activate, and delete versions (see configuration-versions.md)
Confidence threshold configuration for HITL (Human-in-the-Loop) triggering through the Assessment & HITL Configuration section
Document upload from local computer
Knowledge base querying for document collections
“Chat with document” from the detailed view of the document
Document Process Flow visualization for detailed workflow execution monitoring and troubleshooting
Document Analytics for querying and visualizing processed document data

Edit Sections

The Edit Sections feature provides an intelligent interface for modifying document section classifications and page assignments, with automatic reprocessing optimization for Pattern-2 and Pattern-3 workflows.

Key Capabilities

Section Management: Create, update, and delete document sections with validation
Classification Updates: Change section document types with real-time validation
Page Reassignment: Move pages between sections with overlap detection
Intelligent Reprocessing: Only modified sections are reprocessed, preserving existing data
Immediate Feedback: Status updates appear instantly in the UI
Pattern Compatibility: Available for Pattern-2 and Pattern-3, with informative guidance for Pattern-1

How to Use

Navigate to a completed document’s detail page
In the “Document Sections” panel, click the “Edit Sections” button
For Pattern-2/Pattern-3: Enter edit mode with inline editing capabilities
For Pattern-1: View informative modal explaining BDA architecture differences

Editing Workflow (Pattern-2/Pattern-3)

Edit Section Classifications: Use dropdowns to change document types
Modify Page Assignments: Edit comma-separated page IDs (e.g., “1, 2, 3”)
Add New Sections: Click “Add Section” for new document boundaries
Delete Sections: Use remove buttons to delete unnecessary sections
Validation: Real-time validation prevents overlapping pages and invalid configurations
Submit Changes: Click “Save & Process Changes” to trigger selective reprocessing

Processing Optimization

The Edit Sections feature uses 2-phase schema knowledge optimization:

Phase 1: Frontend

Selective Payload: Only sends sections that actually changed
Validation Engine: Prevents invalid configurations before submission

Phase 2: Backend

Pipeline: Processing functions automatically skip redundant operations
- OCR: Skips if pages already have OCR data
- Classification: Skips if pages already classified
- Extraction: Skips if sections have extraction data
- Assessment: Skips if extraction results contain assessment data
Selective Reprocessing: Only modified sections lose their data and get reprocessed

Pattern Compatibility

Pattern-2 and Pattern-3 Support

Full Functionality: Complete edit capabilities with intelligent reprocessing
Performance Optimization: Automatic selective processing for efficiency
Data Preservation: Unmodified sections retain all processing results

Pattern-1 Support (Data-Only Edit Mode)

Pattern-1 uses Bedrock Data Automation (BDA) with automatic section management. Edit Mode in Pattern-1 provides data-only editing:

Data Editing: Edit extraction data (predictions and ground truth) via the “Edit Data” button for each section
Section Structure: Read-only - section boundaries, classifications, and page assignments are managed by BDA
Reprocessing: “Save and Reprocess” triggers evaluation and summarization steps without re-invoking BDA
BDA Skip Logic: When reprocessing with existing pages/sections data, BDA invocation is automatically skipped

Note: Pattern-2 and Pattern-3 offer full section structure editing. Pattern-1 maintains BDA-managed section boundaries while allowing extraction data modifications.

Edit Pages

The Edit Pages feature provides an intelligent interface for modifying individual page classifications and text content, with automatic selective reprocessing for Pattern-2 and Pattern-3 workflows.

Key Capabilities

View Page Text: Access clean, readable page text without JSON formatting in a modal editor
Classification Reset: Reset page classifications to force reclassification during reprocessing
Text Editing: Modify page OCR text with immediate S3 saves to prevent data loss
Confidence Editing: Edit OCR confidence data displayed as markdown tables
Split-Pane Editor: Side-by-side layout with text editor and live markdown preview
Intelligent Reprocessing: Only affected sections are reprocessed based on modification type
Pattern Compatibility: Available for Pattern-2 and Pattern-3, with informative guidance for Pattern-1

How to Use

Navigate to a completed document’s detail page
In the “Document Pages” panel, click the “Edit Pages” button
For Pattern-2/Pattern-3: Enter edit mode with page-level editing capabilities
For Pattern-1: View informative modal explaining BDA architecture differences

Editing Workflow (Pattern-2/Pattern-3)

View Mode (Default)

Click “View Page Text” button to view page content in read-only mode
Modal displays text with live markdown preview
Switch to “Text + Confidence” view to see OCR confidence table

Edit Mode

Click “Edit Pages”: Activates edit mode for all pages
Reset Page Classification (optional):
- Click the button next to page Class/Type
- Page becomes “Unclassified” and will be reclassified during reprocessing
Edit Page Text:
- Click “Edit Page Text” button to open modal editor
- Text + Markdown View: Edit plain text (left) with live markdown preview (right)
- Text + Confidence View: Edit markdown confidence table (left) with rendered preview (right)
- Click “Save” to write changes to S3 immediately
- Unsaved changes warning prevents data loss
Submit Changes: Click “Save & Process Changes” to trigger reprocessing
Review Impact: Confirmation modal shows how many pages will trigger reclassification vs re-extraction
Confirm: Click “Confirm & Process” to submit document for selective reprocessing

Processing Optimization

The Edit Pages feature uses intelligent selective reprocessing:

Page Classification Reset

Impact: Removes all sections containing the page
Triggers: Full reclassification and re-extraction
Use Case: When page was incorrectly classified

Page Text Modification

Impact: Clears extraction results for sections containing the page
Preserves: Sections and classifications remain intact
Triggers: Re-extraction only (skips OCR and classification)
Use Case: Correcting OCR errors to improve extraction accuracy

Backend Processing

OCR: Automatically skipped (page text already updated)
Classification: Skipped for text-only modifications, runs for class resets
Extraction: Runs for all affected sections
Assessment: Runs if extraction completes

Text Format Handling

Display: Plain text extracted from JSON wrapper - no raw JSON visible to users
Editing: User edits plain text in a clean Monaco editor
Storage: Text wrapped back in {"text": "..."} format for backward compatibility
Confidence: Markdown table format for readability (Text | Confidence columns)

Pattern Compatibility

Pattern-2 and Pattern-3 Support

Full Functionality: Complete page editing capabilities with intelligent reprocessing
Performance Optimization: Automatic selective processing based on modification type
Data Preservation: Unmodified pages and sections retain all processing results

Pattern-1 Information

Pattern-1 uses Bedrock Data Automation (BDA) with automatic page management. When Edit Pages is clicked, users see an informative modal explaining:

Architecture Differences: BDA handles page processing automatically
Alternative Workflows: Available options like “View Page Text”, Configuration updates, and document reprocessing
Future Considerations: Guidance on using Pattern-2/Pattern-3 for fine-grained page control

Prompt Preview

The Prompt Preview tab in the Configuration page allows you to see exactly what prompts are sent to the LLM for each processing step, with configuration-derived placeholders filled in using your actual class definitions and schemas.

Key Capabilities

Step Selection: Preview prompts for Classification, Extraction, Assessment, or Summarization
Class Selection: For Extraction and Assessment, select a specific document class to see how its JSON Schema appears in the prompt
Filled Placeholders: Config-derived values ({CLASS_NAMES_AND_DESCRIPTIONS}, {ATTRIBUTE_NAMES_AND_DESCRIPTIONS}, {DOCUMENT_CLASS}) are replaced with actual values from your configuration
Runtime Markers: Document-specific placeholders ({DOCUMENT_TEXT}, {DOCUMENT_IMAGE}, etc.) are shown as highlighted yellow markers indicating where document content will be inserted at processing time
Token Estimates: Approximate token counts for system and task prompts help with cost awareness
Copy to Clipboard: Copy rendered or raw prompt templates for use in external tools
Substitution Details: See the exact formatted class list or cleaned JSON Schema that gets inserted into the prompt

How to Use

Navigate to the Configuration page
Click the Prompt Preview tab (alongside Configuration, Document Schema, Rule Schema)
Select a Processing Step from the dropdown (Classification, Extraction, Assessment, Summarization)
For Extraction or Assessment, select a Document Class to preview its schema in the prompt
View the rendered prompts in the Task Prompt and System Prompt sub-tabs
Use the Raw Task Template and Raw System Template sub-tabs to see the unprocessed templates with placeholder syntax

Use Cases

Schema Optimization: See how your document class attributes appear in the extraction prompt — optimize descriptions and structure for better LLM comprehension
Prompt Debugging: Verify that custom prompt edits render correctly with actual placeholder values
Cost Awareness: Estimate prompt token usage before processing documents
Training: Help new users understand what the LLM actually sees during each processing step

Document Analytics

The Document Analytics feature allows users to query their processed documents using natural language and receive results in various formats including charts, tables, and text responses.

Key Capabilities

Natural Language Queries: Ask questions about your processed documents in plain English
Multiple Response Types: Results can be displayed as:
- Interactive charts and graphs (using Chart.js)
- Structured data tables with pagination and sorting
- Text-based responses and summaries
Real-time Processing: Query processing status updates with visual indicators
Query History: Track and review previous analytics queries

Technical Implementation Notes

The analytics feature uses a combination of real-time subscriptions and polling for status updates:

Primary Method: GraphQL subscriptions via AWS AppSync for immediate notifications when queries complete
Fallback Method: Polling every 5 seconds to ensure status updates are received even if subscriptions fail
Current Limitation: The AppSync subscription currently returns a Boolean completion status rather than full job details, requiring a separate query to fetch results when notified

TODO: Implement proper AppSync subscriptions that return complete AnalyticsJob objects to eliminate the need for additional queries and improve real-time user experience.

How to Use

Navigate to the “Document Analytics” section in the web UI
Enter your question in natural language (e.g., “How many documents were processed last week?”)
Click “Submit Query” to start processing
Monitor the status indicator as your query is processed
View results in the appropriate format (chart, table, or text)
Use the debug information toggle to inspect raw response data if needed

Document Process Flow Visualization

The Document Process Flow feature provides a visual representation of the Step Functions workflow execution for each document:

Document Process Flow

The Document Process Flow visualization showing the execution steps, status, and details.

Key Capabilities

Interactive Flow Diagram: Visual representation of the document processing workflow with color-coded status indicators
Step Details: Detailed information about each processing step including inputs, outputs, and execution time
Error Diagnostics: Clear visualization of failed steps with detailed error messages for troubleshooting
Timeline View: Chronological view of all processing steps with duration information
Auto-Refresh: Option to automatically refresh the flow data for active executions
Map State Support: Visualization of Map state iterations for parallel processing workflows

How to Use

Navigate to a document’s detail page
Click the “View Processing Flow” button in the document details header
The flow diagram will display all steps in the document’s processing workflow
Click on any step to view its detailed information including:
- Input/output data
- Execution duration
- Error messages (if applicable)
- Start and completion times
For active executions, toggle the auto-refresh option to monitor progress in real-time

Troubleshooting with Process Flow

The Document Process Flow visualization is particularly useful for troubleshooting processing issues:

Quickly identify which step in the workflow failed
View detailed error messages and stack traces
Understand the sequence of processing steps
Analyze execution times to identify performance bottlenecks
Inspect the input and output of each step to verify data transformation

Chat with Document

The “Chat with Document” feature is available at the bottom of the Document Detail view. This feature uses the same model that’s configured to do the summarization to provide a RAG interface to the document that’s the details are displayed for. No other document is taken in to account except the document you’re viewing the details of. Note that this feature will only work after the document status is marked as complete.

Your chat history will be saved as you continue your chat but if you leave the document details screen, your chat history is erased. This feature uses prompt caching for the document contents for repeated chat requests for each document.

See the feature in action in this video:

How to Use

Navigate to a document’s detail page and scroll to the bottom
In the text area, type in your question and you’ll see an answer pop up after the document is analyzed with the Nova Pro model

Authentication Features

The web UI uses Amazon Cognito for secure user authentication and authorization:

User Management

Admin users can be created during stack deployment
Optional self-service sign-up with email domain restrictions
Automatic email verification
Password policies and account recovery

Security Controls

Multi-factor authentication (MFA) support
Temporary credentials and automatic token refresh
Role-based access control using Cognito user groups
Secure session management

Deploying the Web UI

The web UI is automatically deployed as part of the CloudFormation stack. The deployment:

Creates required Cognito resources (User Pool, Identity Pool)
Builds and deploys the React application to S3
Sets up CloudFront distribution for content delivery (or ALB for VPC-based hosting — see ALB Hosting)
Configures necessary IAM roles and permissions

Accessing the Web UI

Once the stack is deployed:

Navigate to the ApplicationWebURL provided in the stack outputs
For first-time access:
- Use the admin email address specified during stack deployment
- Check your email for temporary credentials
- You will be prompted to change your password on first login

Running the UI Locally

To run the web UI locally for development:

Navigate to the /ui directory
Create a .env file using the WebUITestEnvFile output from the CloudFormation stack:

VITE_USER_POOL_ID=<value>
VITE_USER_POOL_CLIENT_ID=<value>
VITE_IDENTITY_POOL_ID=<value>
VITE_APPSYNC_GRAPHQL_URL=<value>
VITE_AWS_REGION=<value>
VITE_SETTINGS_PARAMETER=<value>

Install dependencies: npm install
Start the development server: npm run start
Open http://localhost:3000 in your browser

Configuration Options

The following parameters are configured during stack deployment:

AdminEmail: Email address for the admin user
AllowedSignUpEmailDomain: Optional comma-separated list of allowed email domains for self-service signup

Security Considerations

The web UI implementation includes several security features:

All communication is encrypted using HTTPS
Authentication tokens are automatically rotated
Session timeouts are enforced
CloudFront distribution uses secure configuration (or ALB with TLS 1.3 for VPC-based hosting)
S3 buckets are configured with appropriate security policies
API access is controlled through IAM and Cognito
Web Application Firewall (WAF) protection for AppSync API

Web Application Firewall (WAF)

The solution includes AWS WAF integration to protect your AppSync API:

IP-based access control: Restrict API access to specific IP ranges
Default behavior: By default (0.0.0.0/0), WAF is disabled and all IPs are allowed
Configuration: Use the WAFAllowedIPv4Ranges parameter to specify allowed IP ranges
- Example: "192.168.1.0/24,10.0.0.0/16" (comma-separated list of CIDR blocks)
Security benefit: When properly configured, WAF blocks all traffic except from your trusted IP ranges and AWS Lambda service IP ranges
Lambda service access: The solution automatically maintains a WAF IPSet with current AWS Lambda service IP ranges to ensure Lambda functions can always access the AppSync API even when IP restrictions are enabled

When configuring the WAF:

IP ranges must be in valid CIDR notation (e.g., 192.168.1.0/24)
Multiple ranges should be comma-separated
The WAF is only enabled when the parameter is set to something other than the default 0.0.0.0/0
Lambda functions within your account will automatically have access to the AppSync API regardless of IP restrictions

Monitoring and Troubleshooting

The web UI includes built-in monitoring:

CloudWatch metrics for API and authentication activity
Access logs in CloudWatch Logs
CloudFront distribution logs (or ALB access logs for VPC-based hosting)
Error tracking and reporting
Performance monitoring

To troubleshoot issues:

Check CloudWatch Logs for application errors
Verify Cognito user status in the AWS Console
Check CloudFront distribution status (or ALB target group health for VPC-based hosting)
Verify API endpoints are accessible
Review browser console for client-side errors