Portfolio
Case Studies
RAG Chatbot with Single Webpage
Research
Category: AI Experimentation
Technologies Used: OpenAI, Pinecone, LangChain
Concept
I wanted to be able to provide an AI Knowledge Base for a Chatbot that could be easily implemented and prototyped. This was so that I could make a baseline implementation to enhance with client needs and apply to their specific use-case.
Process
graph LR
subgraph Data Sources
D1[Raw Website Data] -->|Process| D2[Filtered & Chunked Data]
D2 -->|Embed| D3[Vector Embeddings]
end
subgraph Storage
D3 -->|Store| VDB[(Pinecone Vector DB)]
end
subgraph Query Flow
Q1[User Question] -->|Query| VDB
VDB -->|Context| Q2[Enhanced Question]
Q2 -->|Process| Q3[GPT-4 Response]
end
style D1 fill:#f9f9f9,stroke:#489FB5,stroke-width:2px
style D2 fill:#f9f9f9,stroke:#489FB5,stroke-width:2px
style D3 fill:#f9f9f9,stroke:#489FB5,stroke-width:2px
style VDB fill:#f9f9f9,stroke:#489FB5,stroke-width:2px
style Q1 fill:#f9f9f9,stroke:#489FB5,stroke-width:2px
style Q2 fill:#f9f9f9,stroke:#489FB5,stroke-width:2px
style Q3 fill:#f9f9f9,stroke:#489FB5,stroke-width:2px
I took the idea that I had and templated the workflow using FlowiseAI. The no-code UI allowed me to experiment with how I wanted to source the information and make connections with the embeddings from OpenAI to the Pinecone vector DB. Using the VectorDB, I enhanced questions that were asked of the chatbot using the gpt-4o-mini
model to give contextual information.
Outcomes
- I made a github repository with my base implementation that I can now showcase to people what high level prototyping can lead to
- I learned about the efficacy that sanitizing the data and chunking it appropriately can have on the cost and token utilization when leveraging LLM Models
Product Development
Notification System Optimizations
Details
Category: Software Engineering
Technologies Used: Spring Boot, Java, Apache Kafka, Microsoft SQL Server, Kubernetes
Problem Statement
The existing notification system was complex, obfuscated, and inconsistent. We needed a framework to standardize the creation of new notification types and their default behaviors within our system.
Previous Architecture
sequenceDiagram
participant Timer
participant Scheduler
participant DB
participant UserService
participant NotificationService
Note over Timer: Every 5 minutes
Timer->>Scheduler: Trigger notification check
Scheduler->>DB: Query events from last 5 minutes
DB-->>Scheduler: Return recent events
Scheduler->>UserService: Get all users
UserService-->>Scheduler: Return user list
loop For each event
Scheduler->>UserService: Filter users by profile
UserService-->>Scheduler: Return matching users
loop For each matching user
Scheduler->>NotificationService: Send notification
NotificationService-->>Scheduler: Notification sent
end
end
Issues
The main issue with this implementation was not that it was inherently complex, but that we had so many notifications and this single service was bloated with a lot of timers that couldn't work for the company standards long term. The amount of code became un-maintainable and features were easily lost on what was being used and what wasn't being used. Finally, if there was any change to one notification setup, it would affect all other notifications within the product.
Implementation
I had created a new Spring Boot application that leveraged events streamed through Apache Kafka to then get user notification profiles and send a notification to them based on their existing preferences.
New Architecture
sequenceDiagram
participant EventSource
participant Kafka
participant NotificationProcessor
participant UserService
participant NotificationService
EventSource->>Kafka: Publish event with user data
Kafka->>NotificationProcessor: Stream event
NotificationProcessor->>UserService: Get user notification profile
UserService-->>NotificationProcessor: Return profile preferences
NotificationProcessor->>NotificationService: Send notification based on profile
NotificationService-->>NotificationProcessor: Notification sent
Note over EventSource,Kafka: Event-driven architecture
Note over NotificationProcessor: Single responsibility per processor
Value Provided
The new architecture enabled easy expansion of notification types and eliminated duplicates through Apache Camel routing and the Idempotent Consumer pattern. This provided developers with clear documentation of upstream event sources and their relationships with our notification system.
- I was able to reduce the existing complexities within the notification system by making a template for new notifications to be built upon.
- I reduced the number of unused notifications in the system from a legacy application.
- Removed the reliance on a legacy application which wasn't on the current deployment standards.