AI Engineer Case Study

ELMO Project

Revolutionizing News Consumption Through Advanced AI-Powered Content Generation & Intelligent Aggregation

Executive Summary

As the AI Engineer for ELMO, I architected and implemented an intelligent news aggregation system that fundamentally transformed information consumption patterns. By seamlessly integrating cutting-edge language models with real-time data retrieval mechanisms, I developed a sophisticated AI pipeline capable of generating contextually relevant, non-redundant news content at scale.

Role

AI Engineer

Duration

Jan - May 2025

Team

5 members

Institution

UT Dallas

Performance

32% faster than target response time

Accuracy

94% factual accuracy in generated content

User Impact

78% improvement in comprehension scores

Efficiency

23 minutes saved per user session

Performance Metrics

Quantifiable results demonstrating the impact and effectiveness of the AI engineering solutions

85%

Content Redundancy Reduction

Target: 70%

94%

Factual Accuracy Rate

Target: 90%

3.4s

Response Time

Target: 5s

98%

English Consistency

Target: 95%

Technical Implementation

AI Architecture

Integrated DeepSeek R1:14b with custom deployment and enhancement pipeline

┌─────────────────┐    ┌──────────────────┐    ┌─────────────────┐
│   DeepSeek R1   │    │   Ollama API     │    │  Custom RAG     │
│   (14B params)  │◄──►│   (Local Host)   │◄──►│   Framework     │
└─────────────────┘    └──────────────────┘    └─────────────────┘
         ▲                        ▲                       ▲
         │                        │                       │
         ▼                        ▼                       ▼
┌─────────────────┐    ┌──────────────────┐    ┌─────────────────┐
│  Google Search  │    │   NewsAPI        │    │   Pinecone      │
│      API        │    │   Integration    │    │ Vector Store    │
└─────────────────┘    └──────────────────┘    └─────────────────┘

Key Achievements

Reduced hallucination rate from 23% to <5%
Achieved 98% English-only response rate
Maintained 94% factual accuracy

Performance Impact

99.2% API uptime during development
80% reduction in local resource requirements
Seamless team collaboration enabled

Custom RAG Framework

Built sophisticated Retrieval Augmented Generation combining real-time news data with AI generation

Multi-source Retrieval

NewsAPI + Google Search with intelligent filtering and deduplication

Content Deduplication

Semantic similarity analysis to eliminate redundancy across sources

Dynamic Adaptation

Adjusts output complexity based on user preferences and content type

Technical Challenges & Solutions

Key obstacles overcome during development and innovative solutions implemented

1
Model Hallucination & Language Consistency

2
Real-time Information Integration

3
Content Quality vs. Speed Trade-off

Technical Innovation

Novel approaches and frameworks developed to solve complex AI engineering challenges

Hybrid RAG Architecture

Semantic retrieval using vector embeddings for contextual relevance
Temporal retrieval prioritizing recent content and breaking news
Source-diverse retrieval ensuring perspective variety and bias mitigation

Source Transparency Framework

Comprehensive source tracking and attribution with hyperlink preservation
Credibility scoring and bias analysis integration
Fact-check history integration and verification workflows

Technologies Mastered

Cutting-edge AI and cloud technologies leveraged to build a scalable, intelligent news platform

DeepSeek R1:14b

Ollama API

RAG Architecture

Pinecone Vector DB

AWS Lambda

Prompt Engineering

NewsAPI

Google Search API

Let's Connect

Interested in discussing AI engineering, machine learning innovations, or potential collaboration opportunities? I'd love to connect and explore how we can build the future together.