Skip to main content

Who Am I?

ML Engineer & Data Engineer

The Journey

From a Gold Medal engineering graduate to a global tech innovator - my journey spans across continents and industries. Currently pursuing M.Sc. International Information Systems in Germany while revolutionizing e-commerce at PUMA GROUP.

The Expertise

I develop enterprise-scale solutions and am passionate about exploring intelligent AI systems that can transform business processes. Currently building AI-powered analytics platforms that deliver operational efficiency gains while exploring agentic AI and process analytics as future career paths. I specialize in end-to-end data engineering pipelines, machine learning model deployment using Vertex AI, and scalable cloud architectures with modern tools like dbt, Airflow, and BigQuery. Aspiring to advance into ML Engineering and Data Engineering roles, I'm building expertise in autonomous CI/CD pipelines with Terraform, microservices architectures, and intelligent process optimization - turning complex technical challenges into elegant, performance-driven solutions that scale globally.

The Impact

My solutions deliver measurable business value, and I'm driven by the potential of intelligent automation. With hands-on experience coordinating global OMS operations across 4+ regions and integrating enterprise platforms, I'm building expertise that bridges traditional software engineering with cutting-edge ML and data engineering practices. Through hands-on experience with GCP, Terraform, Docker, and CI/CD pipelines, along with automated testing frameworks, SSO enablement, and enterprise API integrations, I'm developing the skills needed to advance into roles where I can architect intelligent systems that transform entire business processes. My goal is to pioneer the next generation of agentic AI solutions across Banking, E-commerce, Sports & Lifestyle, and Telecommunications sectors.

Tech Arsenal

Languages

Python Java C# C++ JavaScript TypeScript SQL

Frameworks

React Vue.js Next.js Flask Quarkus Spring Boot

Cloud & AI

Google Cloud Vertex AI Agentic AI GenAI BigQuery Azure Terraform

Data & Analytics

Power BI Google Looker Pandas NumPy Scikit-learn Apache Spark Process Analytics

Data Engineering

Apache Kafka dbt Apache Airflow ETL Pipelines Data Warehousing MongoDB PostgreSQL Redis

DevOps & CI/CD

Terraform CI/CD Pipelines Docker Kubernetes GCP DevOps Automated Testing

Academic Excellence

Gold Medallist

Global Experience

5 Companies3+ Years

Performance Impact

4+ RegionsOMS Unified

Future Focus

Agentic AIProcess Analytics

Solving Enterprise Data Challenges

Unified Data Collection

Consolidating fragmented data streams into a single source of truth

Kafka GCS APIs

Scalable Analytics

Enabling complex queries on petabytes of data in seconds

BigQuery SQL

Data Quality & Consistency

Ensuring reliable, tested data models across the organization

dbt Jinja

Intelligent Automation

Self-healing pipelines that scale with business growth

Airflow DAGs

Predictive Intelligence

Turning data into business foresight with production ML

Vertex AI MLOps

Zero-Downtime Delivery

Ensuring 24/7 availability with automated rollback & monitoring

Terraform GCP
40%
Faster Processing
150+
Business Insights
24/7
Automated Operations

Professional Experience

Career Journey

Global Ecommerce Engineering - Working Student

PUMA GROUP

April 2024 – Present
  • Coordinated project management and technical roadmap for the Order Management System (OMS), unifying strategy across 4+ regions (NA, MENA, LATAM) to deliver increased operational efficiency
  • Maintained GCP infrastructure using Terraform and Docker, and set up CI/CD pipelines to streamline deployments for internal projects
  • Assisted in API testing and data mapping for the integration of key enterprise platforms including Salesforce Commerce Cloud, SAP S/4HANA, Adyen, and local WMS systems
  • Built an internal reporting tool on GCP that automated data aggregation, reducing manual tasks for regional e-commerce teams
  • Coordinated SSO enablement for the global Fluent Commerce OMS, working with stakeholders to configure and verify user access
  • Scripted an automated testing framework using Playwright and Python, hosted on Google Cloud Run, improving UAT coverage and saving hours of testing

Analyst - Software & Information Systems

BANK AL HABIB LIMITED

Aug 2022 – Sep 2023
  • Modernized legacy banking system with microservices architecture using Java Quarkus
  • Achieved 30% reduction in deployment time and improved system scalability
  • Built full-stack applications using ASP.NET Core, Vue.js with TypeScript, and Next.js
  • Developed RESTful APIs for secure system communication
  • Delivered analytical reports using IBM Cognos and Power BI
  • Improved reporting efficiency by 40% with data-driven insights

Specialist - Platform Development

THRIFLE TECHNOLOGIES

Aug 2021 – July 2022
  • Defined KPIs and performed user behavior analysis using Google Analytics
  • Created wireframes for platform development and UI/UX design
  • Wrote product descriptions to support go-to-market readiness
  • Supported data-informed product decisions for startup growth

QMS Intern

PTCL GROUP (UFONE) - EXPERIA INTERNSHIP PROGRAM

July 2021 – September 2021
  • Overlooked the BPO services offered by Ufone to its corporate clients in the QMS Team
  • Learnt about KPIs and ensured that SOPs are being followed to achieve those KPIs
  • Visited Satellite Stations & Data Centers, and learnt about telecom network architecture
  • Gained insights into telecommunications infrastructure and quality management systems

SQA Intern

AFINITI

August 2020 – September 2020
  • Tested multiple software applications to identify and analyze bugs at the front end (Black Box Testing)
  • Managed, tracked, and organized data for comprehensive test cases
  • Helped increase the efficiency of software applications through rigorous testing procedures
  • Contributed to quality assurance processes and bug reporting workflows

Featured Projects

Technical Achievements

Perspective-Based Guardrails for Multi-Agent Systems

Thesis

Designed and implemented a 6-layer defense-in-depth guardrail architecture enforcing reliable, role-scoped information flow within a multi-agent system using a real-world supply chain dataset (DataCo Global, 180,519 orders, 909K synthetic messages). Built a LangGraph 6-node agent pipeline with LLM-powered intent classification, SQL generation, and answer synthesis via Gemini with 3-provider fallback . Agents query structured data via PostgreSQL perspective views and semantic data via ChromaDB vector store (568K documents across Sales + Logistics partitions). Achieved 100% guardrail true positive rate , 80% LLM routing accuracy , and sub-second guardrail enforcement latency (P50: 50.75ms).

LangGraph Multi-Agent AI Gemini PostgreSQL ChromaDB Python Docker LLM NLP Supply Chain Security

Agentic AI & LLM Based Analytics

Live

Architected an enterprise-grade Agentic AI data analytics platform leveraging Large Language Models (LLMs) for autonomous data analysis and insights generation. Implemented multi-agent reasoning architecture with context-aware processing, enabling natural language data queries and automated visualization generation. Built with TensorFlow deep learning for anomaly detection using autoencoder neural networks, 3D PCA dimensionality reduction , and intelligent parsing supporting multiple file formats (CSV, Excel). Features 8+ interactive chart types with Plotly, domain-specific business insights, and a conversational AI assistant enabling real-time exploratory analysis through natural dialogue. Deployed with Docker containerization and optimized caching for production-grade performance.

Agentic AI LLM TensorFlow Deep Learning Python Streamlit Plotly Pandas NumPy Scikit-learn NLP Docker

Intelligent Supply Chain Digital Twin

Live

Engineered a comprehensive end-to-end data engineering platform for supply chain optimization combining automated data ingestion , SQL transformations with dbt and 8 production-grade ML models for predictive analytics. Implemented XGBoost classification with SHAP explainability to predict late deliveries, Prophet time series forecasting for demand planning, and K-Means clustering for network optimization. Integrated causal inference modeling (OLS) , safety stock optimization formulas, market basket analysis and discrete-event simulation (SimPy) for scenario testing. Architected with Apache Airflow orchestration for automated daily pipelines, PostgreSQL data warehouse with staging and marts layers, and interactive Jupyter analytics notebooks . Deployed with Docker containerization enabling reproducible, production-grade performance across environments.

Data Engineering Machine Learning Apache Airflow dbt PostgreSQL XGBoost Prophet Python Pandas Scikit-learn SimPy SQLAlchemy Docker Supply Chain Analytics Optimization

IoT Logistics Analytics Platform

Academic

Engineered a comprehensive data analytics platform to address operational disruptions in automated warehouse shuttle (AKL) logistics at airport facilities. Built a modular Python-based data extraction pipeline using Design Science Research Methodology (DSRM) to parse heterogeneous semi-structured log streams and reconstruct complete, timestamped box journeys across conveyor, lift, and buffer stations. Developed ML-driven failure point localization and delay pattern detection models identifying 3,000+ potential operational delays and surfacing systemic idle wait clusters at conveyor merge zones as critical throughput bottlenecks. The platform delivered real-time operational transparency enabling early intervention workflows and data-driven preventive maintenance strategies.

Python Data Engineering ML IoT ETL

Terraform IaC & CI/CD

Internal

Maintained GCP infrastructure using Terraform and Docker for internal e-commerce projects. Set up CI/CD pipelines to streamline deployments, covering environment promotion (dev→staging→prod) and automated validation checks. Improved deployment reliability and consistency across internal projects while maintaining infrastructure as code best practices.

Terraform GCP Docker CI/CD Automation

Global OMS Reporting Tool

Internal

Built an internal reporting tool on GCP that automated data aggregation from multi-region OMS datasets (orders, invoices, inventory, credit memos, product audit trails). Implemented role-based access controls and dashboards, reducing manual tasks for regional e-commerce teams.

Full-Stack GCP Load Balancer Reporting

Hyperspectral Super Resolution

Academic

Implemented an attention-augmented CNN architecture for hyperspectral super-resolution achieving 2× spatial enhancement , ~4 dB PSNR gain and >90% spectral similarity. Ranked 2nd among 20+ capstone projects demonstrating optimized trade-offs between spectral fidelity and reconstruction latency.

Deep Learning TensorFlow Python

Financial Monte Carlo Simulation

Academic

Executed 1,000+ Monte Carlo simulations applying multi-factor discount rate sensitivity and stochastic price curves to derive 10-year NPV distributions for a manufacturing investment case. Built an interactive Power BI executive dashboard surfacing risk bands, sensitivity sliders, and scenario comparatives to accelerate decision cycles.

Monte Carlo Power BI Modeling

Banking Microservices Migration

Professional

Modernized a legacy core banking stack by introducing a microservices architecture using Quarkus (Java) for low-memory JVM services and ASP.NET Core for high-throughput APIs. Implemented event-driven messaging and containerized CI/CD pipelines, reducing deployment time by ~30% while improving horizontal scalability and operational observability.

Microservices Quarkus ASP.NET Core

Skills & Technologies

Technical Expertise

Programming Languages

Python SQL JavaScript TypeScript Java C# C++

Frontend Development

React Vue.js Next.js HTML5/CSS3 Tailwind CSS Bootstrap

Backend Development

Node.js FastAPI Flask ASP.NET Core Java Quarkus Spring Boot Django GraphQL REST APIs

Data Engineering & Orchestration

Apache Airflow dbt Pandas NumPy ETL Pipelines Data Warehousing Streamlit Jupyter Notebook

Database & Cloud

Google Cloud Platform BigQuery PostgreSQL Neo4j MongoDB SQL SQLAlchemy Cloud SQL GCS

AI & Analytics

Google Vertex AI TensorFlow Scikit-learn XGBoost LangChain LLMs (Gemini, OpenAI) GenAI Machine Learning Deep Learning SHAP Power BI Tableau

Enterprise Systems

Salesforce Commerce Cloud SAP ERP Integration Order Management Systems Microservices Jira Confluence

DevOps & Infrastructure

Docker Terraform CI/CD Git GitHub Actions PgAdmin Load Balancing

Visualization & Analysis

Plotly Matplotlib Seaborn Statsmodels Data Visualization Statistical Analysis

Technology Proficiency Matrix

Real-world Experience & Expertise
ML & AI
95%
Cloud & GCP
90%
Data Engineering
95%
DevOps & IaC
85%
Backend Dev
90%
Frontend Dev
80%

ML & AI

Vertex AI TensorFlow Scikit-learn LangChain GPT-4o

Cloud & GCP

Google Cloud BigQuery Cloud Functions GCS Cloud SQL

Data Engineering

Apache Airflow dbt Kafka Pandas Spark

DevOps & IaC

Terraform Docker Kubernetes CI/CD GitHub Actions

Backend Dev

Python Java Quarkus ASP.NET Core FastAPI PostgreSQL

Frontend Dev

React Vue.js Next.js TypeScript Tailwind

Education

Academic Excellence
2024 - 2026

Master of Science in International Information Systems

Friedrich-Alexander University Erlangen-Nürnberg

Specializing in advanced information systems, data analytics, and enterprise architecture. Focus on international business applications and cross-cultural technology implementation.

International Focus Data Analytics Enterprise Systems

Master's Thesis (In Progress)

Perspective-Based Guardrails for Reliable Information Flow in Multi-Agent Systems

Designed and implemented a 6-layer defense-in-depth guardrail architecture that enforces role-scoped information flow within a LLM-powered multi-agent system. Built on a real-world supply chain dataset (DataCo Global, 180,519 orders) with 909K synthetic communication messages, the system uses LangGraph agent orchestration querying structured (PostgreSQL) and unstructured (ChromaDB) data through perspective-enforced access controls.

100%
Guardrail True Positive Rate
80%
LLM Routing Accuracy
6
Defense-in-Depth Layers
50ms
Guardrail Query P50
LangGraph PostgreSQL ChromaDB Gemini Multi-Agent AI Python
2018 - 2022

Bachelor of Science in Engineering Sciences

Ghulam Ishaq Khan Institute of Engineering Sciences and Technology

Gold Medal

Graduated with Gold Medal for exceptional academic performance. Strong foundation in engineering principles, mathematics, and computer science with focus on software engineering.

Gold Medal Achievement Software Engineering Mathematics & Analytics

Bachelor's Thesis

Attention-Based CNN for Hyperspectral & Multispectral Image Fusion

Architected a novel attention-based Convolutional Neural Network (CNN) in TensorFlow to solve the problem of low-resolution sensor data, successfully fusing hyperspectral and multispectral images for enhanced analysis. Achieved a 2× spatial resolution enhancement that outperformed standard interpolation methods, boosting image PSNR by nearly ~4 dB while critically maintaining over 90% spectral similarity for accurate ore classification.

Spatial Enhancement
~4 dB
PSNR Gain
>90%
Spectral Similarity
TensorFlow Deep Learning CNN Python Image Processing

Certifications

Professional Credentials

Transforming Data into Information using Power BI

Supply Chain Talks

Issued May 2022 Credential ID: SCT-00000533
Certified

Data Science Professional Certificate

IBM

Issued Aug 2020 View Certificate
Certified

DevOps Fundamental

EdYoda Digital University

Issued Jul 2020 View Certificate
Certified

Elements of Artificial Intelligence

University of Helsinki

Issued Jul 2020 View Certificate
Certified

Introduction to Business Management

King's College London

Issued Jun 2020 View Certificate
Certified

Virtual Experience Program

Microsoft

Issued Apr 2020 Credential ID: rEB5PaJsKh4sxMXPL
Certified

Initialize Connection

Let's Engineer Solutions Together
agentic_ai_collaboration.py
01 class CollaborationAgent:
02 def __init__(self):
03 self.expertise = ["ML Engineering", "Data Engineering"]
04 self.tech_stack = ["GCP", "Vertex AI", "Terraform"]
05 self.available = True
06
07 def connect(channel):
08 return "Ready to build intelligent solutions!"|
CONNECTED

Professional Network

linkedin.com/in/ghufranakbar

Network: 500+ Connections
Connect via LinkedIn
AVAILABLE

Geographic Node

Nuremberg, Germany

Timezone: CET/CEST
On-site Collaboration Ready
PROCESSING

AI Collaboration Engine

Intelligent Project Matching

Match Score: 98%
View Compatibility Matrix
5 Industries
10+ Tech Stacks
2 Continents
24/7 Innovation

Tic Tac Toe vs AI

Your turn! Click a square to play.

You: 0 | AI: 0