Jay Bennett

Senior Data Engineer | Minneapolis, MN

AI-Augmented Data Engineering

Real projects where AI delivered measurable business impact

🎯 Featured Project: Tennessee CCWIS Automation

Client: Tennessee Department of Children's Services (via Deloitte)

Duration: 5 months (2024)

Result: 70% reduction in manual data mapping effort

The Challenge

Tennessee was migrating their Child Welfare system (CCWIS) from a legacy platform to Salesforce. The data warehouse team needed to:

Manual process: Analysts spending 4-6 hours per report reading docs, cross-referencing metadata, writing specifications in spreadsheets.

Volume: 100+ reports across Child Care, Independent Living, and OCS domains.

Timeline pressure: State government deadlines, multi-million dollar project.

The Opportunity

I noticed the team doing repetitive pattern-matching work:

Insight: This is exactly what LLMs are good at - reading structured documents, finding patterns, generating structured output.

The Solution: Python + ChatGPT Automation

Architecture

Built a Python pipeline that:

  1. Ingested Salesforce metadata - Parsed XML/JSON files into structured data
  2. Extracted report requirements - Read business requirement docs, pulled relevant sections
  3. Generated AI prompts - Created context-rich prompts with metadata + requirements
  4. Called ChatGPT API - Sent prompts, received structured mappings as JSON
  5. Validated output - Checked for completeness, flagged ambiguities
  6. Generated Excel specs - Formatted into analyst-friendly spreadsheets

Prompting Strategy

The key was giving ChatGPT the right context:

Prompt Structure (High-Level):

You are a data warehouse architect analyzing Salesforce metadata.

CONTEXT:
- Report Name: [Report Title]
- Business Purpose: [From requirements doc]
- Salesforce Objects: [List of tables involved]

METADATA:
[Paste relevant Salesforce field definitions]

TASK:
Generate technical mapping specification:
1. Identify primary fact table
2. List dimension tables
3. Map each report column to Snowflake table.column
4. Classify field types (measure vs attribute)
5. Note any ambiguities or data quality concerns

OUTPUT FORMAT: JSON
      

What made it work:

Implementation Details

Technology Stack

Workflow Integration

Human-in-the-loop design: AI generated 80% of mapping, analysts reviewed and refined.

  1. Python script generated draft specifications
  2. Analyst reviewed for accuracy (15-30 minutes vs 4-6 hours)
  3. Analyst added business context AI couldn't infer
  4. Final spec approved by lead architect

This preserved data quality while dramatically reducing time investment.

Results & Impact

Quantitative Results

Business Impact

Technical Learning

Key Takeaways

When to Use AI for Automation

✅ Good candidates:

❌ Poor candidates:

My AI Development Approach

  1. Start small: Automate one report as proof-of-concept
  2. Measure results: Compare AI output vs manual work
  3. Iterate prompts: Refine based on what AI gets wrong
  4. Add validation: Build checks for common failure modes
  5. Human-in-loop: Design for review, not full automation
  6. Document everything: Prompts, examples, edge cases

Other AI-Augmented Projects

xBrezzo CaseBuilder (2025)

Built family law case management system using Lovable AI platform:

Daily Development Work

I use AI tools constantly for routine tasks:

Estimated productivity gain: 10x on routine tasks, 3-5x on complex problems

Want to Learn More?

I'm happy to discuss AI-augmented development approaches, share lessons learned, or collaborate on similar automation opportunities.

Contact me | View full experience