Skip to main content

Who Am I?

ML Engineer & Data Engineer

The Journey

From a Gold Medal engineering graduate to a global tech innovator - my journey spans across continents and industries. Currently pursuing M.Sc. International Information Systems in Germany while revolutionizing e-commerce at PUMA GROUP.

The Expertise

I develop enterprise-scale solutions and am passionate about exploring intelligent AI systems that can transform business processes. Currently building AI-powered analytics platforms that deliver operational efficiency gains while exploring agentic AI and process analytics as future career paths. I specialize in end-to-end data engineering pipelines, machine learning model deployment using Vertex AI, and scalable cloud architectures with modern tools like dbt, Airflow, and BigQuery. Aspiring to advance into ML Engineering and Data Engineering roles, I'm building expertise in autonomous CI/CD pipelines with Terraform, microservices architectures, and intelligent process optimization - turning complex technical challenges into elegant, performance-driven solutions that scale globally.

The Impact

My solutions deliver measurable business value, and I'm driven by the potential of intelligent automation. With a proven track record of 98% UAT success rates and 40% operational efficiency gains, I'm building expertise that bridges traditional software engineering with cutting-edge ML and data engineering practices. Through hands-on experience with Vertex AI, BigQuery, dbt, and Airflow, along with cloud computing platforms, Terraform scripts, and CI/CD pipelines, I'm developing the skills needed to advance into roles where I can architect intelligent systems that transform entire business processes. My goal is to pioneer the next generation of agentic AI solutions across Banking, E-commerce, Sports & Lifestyle, and Telecommunications sectors.

Tech Arsenal

Languages

Python Java C# C++ JavaScript TypeScript SQL

Frameworks

React Vue.js Next.js Flask Quarkus Spring Boot

Cloud & AI

Google Cloud Vertex AI Agentic AI GenAI BigQuery Azure Terraform

Data & Analytics

Power BI Google Looker Pandas NumPy Scikit-learn Apache Spark Process Analytics

Data Engineering

Apache Kafka dbt Apache Airflow ETL Pipelines Data Warehousing MongoDB PostgreSQL Redis

DevOps & CI/CD

Terraform CI/CD Pipelines Docker Kubernetes GCP DevOps Automated Testing

Academic Excellence

Gold Medallist

Global Experience

5 Companies3+ Years

Performance Impact

40% Efficiency98% Success Rate

Future Focus

Agentic AIProcess Analytics

Solving Enterprise Data Challenges

Unified Data Collection

Consolidating fragmented data streams into a single source of truth

Kafka GCS APIs

Scalable Analytics

Enabling complex queries on petabytes of data in seconds

BigQuery SQL

Data Quality & Consistency

Ensuring reliable, tested data models across the organization

dbt Jinja

Intelligent Automation

Self-healing pipelines that scale with business growth

Airflow DAGs

Predictive Intelligence

Turning data into business foresight with production ML

Vertex AI MLOps

Zero-Downtime Delivery

Ensuring 24/7 availability with automated rollback & monitoring

Terraform GCP
40%
Faster Processing
150+
Business Insights
24/7
Automated Operations

Professional Experience

Career Journey

Global Ecommerce Engineering - Working Student

PUMA GROUP

April 2024 – Present
  • Building AI-powered analytics platforms using Vertex AI, BigQuery, dbt, and Airflow
  • Implementing end-to-end data engineering pipelines for global e-commerce operations
  • Deploying machine learning models with CI/CD automation using Terraform
  • Achieving 98% UAT success rate and 40% operational efficiency gains
  • Developing scalable cloud architectures on Google Cloud Platform
  • Researching and implementing Agentic AI and process analytics solutions

Analyst - Software & Information Systems

BANK AL HABIB LIMITED

Aug 2022 – Sep 2023
  • Modernized legacy banking system with microservices architecture using Java Quarkus
  • Achieved 30% reduction in deployment time and improved system scalability
  • Built full-stack applications using ASP.NET Core, Vue.js with TypeScript, and Next.js
  • Developed RESTful APIs for secure system communication
  • Delivered analytical reports using IBM Cognos and Power BI
  • Improved reporting efficiency by 40% with data-driven insights

Specialist - Platform Development

THRIFLE TECHNOLOGIES

Aug 2021 – July 2022
  • Defined KPIs and performed user behavior analysis using Google Analytics
  • Created wireframes for platform development and UI/UX design
  • Wrote product descriptions to support go-to-market readiness
  • Supported data-informed product decisions for startup growth

QMS Intern

PTCL GROUP (UFONE) - EXPERIA INTERNSHIP PROGRAM

July 2021 – September 2021
  • Overlooked the BPO services offered by Ufone to its corporate clients in the QMS Team
  • Learnt about KPIs and ensured that SOPs are being followed to achieve those KPIs
  • Visited Satellite Stations & Data Centers, and learnt about telecom network architecture
  • Gained insights into telecommunications infrastructure and quality management systems

SQA Intern

AFINITI

August 2020 – September 2020
  • Tested multiple software applications to identify and analyze bugs at the front end (Black Box Testing)
  • Managed, tracked, and organized data for comprehensive test cases
  • Helped increase the efficiency of software applications through rigorous testing procedures
  • Contributed to quality assurance processes and bug reporting workflows

Featured Projects

Technical Achievements

Agentic AI & LLM Based Analytics

Live

Architected an enterprise-grade Agentic AI data analytics platform leveraging Large Language Models (LLMs) for autonomous data analysis and insights generation. Implemented multi-agent reasoning architecture with context-aware processing, enabling natural language data queries and automated visualization generation. Built with TensorFlow deep learning for anomaly detection using autoencoder neural networks, 3D PCA dimensionality reduction , and intelligent parsing supporting multiple file formats (CSV, Excel). Features 8+ interactive chart types with Plotly, domain-specific business insights, and a conversational AI assistant enabling real-time exploratory analysis through natural dialogue. Deployed with Docker containerization and optimized caching for production-grade performance.

Agentic AI LLM TensorFlow Deep Learning Python Streamlit Plotly Pandas NumPy Scikit-learn NLP Docker

Intelligent Supply Chain Digital Twin

Live

Engineered a comprehensive end-to-end data engineering platform for supply chain optimization combining automated data ingestion , SQL transformations with dbt and 8 production-grade ML models for predictive analytics. Implemented XGBoost classification with SHAP explainability to predict late deliveries, Prophet time series forecasting for demand planning, and K-Means clustering for network optimization. Integrated causal inference modeling (OLS) , safety stock optimization formulas, market basket analysis and discrete-event simulation (SimPy) for scenario testing. Architected with Apache Airflow orchestration for automated daily pipelines, PostgreSQL data warehouse with staging and marts layers, and interactive Jupyter analytics notebooks . Deployed with Docker containerization enabling reproducible, production-grade performance across environments.

Data Engineering Machine Learning Apache Airflow dbt PostgreSQL XGBoost Prophet Python Pandas Scikit-learn SimPy SQLAlchemy Docker Supply Chain Analytics Optimization

IoT Logistics Analytics Platform

Academic

Engineered a comprehensive data analytics platform to address operational disruptions in automated warehouse shuttle (AKL) logistics at airport facilities. Built a modular Python-based data extraction pipeline using Design Science Research Methodology (DSRM) to parse heterogeneous semi-structured log streams and reconstruct complete, timestamped box journeys across conveyor, lift, and buffer stations. Developed ML-driven failure point localization and delay pattern detection models identifying 3,000+ potential operational delays and surfacing systemic idle wait clusters at conveyor merge zones as critical throughput bottlenecks. The platform delivered real-time operational transparency enabling early intervention workflows and data-driven preventive maintenance strategies.

Python Data Engineering ML IoT ETL

Vertex AI Order Forecasting

Internal

Designed and deployed end-to-end demand forecasting pipelines on Google Vertex AI orchestrated with Apache Airflow and transformation layers in dbt + BigQuery leveraging feature engineering (seasonality extraction, regional normalization, holiday encoding), data quality validation, and automated model selection across ARIMA / XGBoost / DNN regressors. Produced double-digit MAPE reduction improving regional inventory allocation accuracy, reducing stock imbalance risk, and enabling earlier capacity planning signals.

Vertex AI Airflow dbt BigQuery Python Pandas Scikit-learn ML Forecasting GCP

Terraform IaC & CI/CD

Internal

Led PUMA's cloud modernization by authoring reusable Terraform module library covering VPC design (subnets, routing), IAM least-privilege roles, GCS lifecycle policies, Cloud SQL provisioning, service accounts & workload identity federation. Integrated Git-driven CI/CD pipelines with environment promotion (dev→staging→prod), automated policy checks (fmt / validate / plan security review) and drift detection. Delivered >70% faster provisioning (days→hours) while reducing infrastructure cost footprint through rightsizing & reusable patterns and enhancing audit traceability for compliance.

Terraform GCP VPC Cloud SQL IAM CI/CD Cost Optimization Security Automation

Global OMS Reporting Tool

Internal

Developed a full-stack reporting application deployed behind a GCP global HTTP load balancer aggregating multi-region OMS datasets (orders, invoices, inventory, credit memos, product audit trails). Implemented role-based access controls and metric-driven dashboards enabling unified visibility and reducing manual reconciliation effort.

Full-Stack GCP Load Balancer Reporting

Hyperspectral Super Resolution

Academic

Implemented an attention-augmented CNN architecture for hyperspectral super-resolution achieving 2× spatial enhancement , ~4 dB PSNR gain and >90% spectral similarity. Ranked 2nd among 20+ capstone projects demonstrating optimized trade-offs between spectral fidelity and reconstruction latency.

Deep Learning TensorFlow Python

Financial Monte Carlo Simulation

Academic

Executed 1,000+ Monte Carlo simulations applying multi-factor discount rate sensitivity and stochastic price curves to derive 10-year NPV distributions for a manufacturing investment case. Built an interactive Power BI executive dashboard surfacing risk bands, sensitivity sliders, and scenario comparatives to accelerate decision cycles.

Monte Carlo Power BI Modeling

Banking Microservices Migration

Professional

Modernized a legacy core banking stack by introducing a microservices architecture using Quarkus (Java) for low-memory JVM services and ASP.NET Core for high-throughput APIs. Implemented event-driven messaging and containerized CI/CD pipelines, reducing deployment time by ~30% while improving horizontal scalability and operational observability.

Microservices Quarkus ASP.NET Core

Agentic AI Architecture

Multi-Agent Intelligence System
Orchestrator Agent
LangGraph Coordinator
Research Agent
GPT-4o + RAG
Data collection & context gathering
Analysis Agent
GPT-4o + TensorFlow
Statistical analysis & ML insights
Visualization Agent
Plotly + D3.js
Interactive chart generation
Natural Language Queries
Real-time Processing
Context-Aware AI
Enterprise Grade

Skills & Technologies

Technical Expertise

Programming Languages

Python SQL JavaScript TypeScript Java C# C++

Frontend Development

React Vue.js Next.js HTML5/CSS3 Tailwind CSS Bootstrap

Backend Development

Node.js FastAPI Flask ASP.NET Core Java Quarkus Spring Boot Django GraphQL REST APIs

Data Engineering & Orchestration

Apache Airflow dbt Pandas NumPy ETL Pipelines Data Warehousing Streamlit Jupyter Notebook

Database & Cloud

Google Cloud Platform BigQuery PostgreSQL Neo4j MongoDB SQL SQLAlchemy Cloud SQL GCS

AI & Analytics

Google Vertex AI TensorFlow Scikit-learn XGBoost LangChain LLMs (Gemini, OpenAI) GenAI Machine Learning Deep Learning SHAP Power BI Tableau

Enterprise Systems

Salesforce Commerce Cloud SAP ERP Integration Order Management Systems Microservices Jira Confluence

DevOps & Infrastructure

Docker Terraform CI/CD Git GitHub Actions PgAdmin Load Balancing

Visualization & Analysis

Plotly Matplotlib Seaborn Statsmodels Data Visualization Statistical Analysis

Technology Proficiency Matrix

Real-world Experience & Expertise
ML & AI
95%
Cloud & GCP
90%
Data Engineering
95%
DevOps & IaC
85%
Backend Dev
90%
Frontend Dev
80%

ML & AI

Vertex AI TensorFlow Scikit-learn LangChain GPT-4o

Cloud & GCP

Google Cloud BigQuery Cloud Functions GCS Cloud SQL

Data Engineering

Apache Airflow dbt Kafka Pandas Spark

DevOps & IaC

Terraform Docker Kubernetes CI/CD GitHub Actions

Backend Dev

Python Java Quarkus ASP.NET Core FastAPI PostgreSQL

Frontend Dev

React Vue.js Next.js TypeScript Tailwind

ML Workflow DAG

Orchestrated with Apache Airflow
Extract Data
2min
Validate
1min
Transform (dbt)
3min
Feature Eng
2min
Train Model
15min
Evaluate
2min
Test
3min
Deploy to Vertex AI
5min
Monitor
Success
Running
Pending
Failed

Education

Academic Excellence
2024 - 2026

Master of Science in International Information Systems

Friedrich-Alexander University Erlangen-Nürnberg

Specializing in advanced information systems, data analytics, and enterprise architecture. Focus on international business applications and cross-cultural technology implementation.

International Focus Data Analytics Enterprise Systems
2018 - 2022

Bachelor of Science in Engineering Sciences

Ghulam Ishaq Khan Institute of Engineering Sciences and Technology

Gold Medal

Graduated with Gold Medal for exceptional academic performance. Strong foundation in engineering principles, mathematics, and computer science with focus on software engineering.

Gold Medal Achievement Software Engineering Mathematics & Analytics

Certifications

Professional Credentials

Transforming Data into Information using Power BI

Supply Chain Talks

Issued May 2022 Credential ID: SCT-00000533
Certified

Data Science Professional Certificate

IBM

Issued Aug 2020 View Certificate
Certified

DevOps Fundamental

EdYoda Digital University

Issued Jul 2020 View Certificate
Certified

Elements of Artificial Intelligence

University of Helsinki

Issued Jul 2020 View Certificate
Certified

Introduction to Business Management

King's College London

Issued Jun 2020 View Certificate
Certified

Virtual Experience Program

Microsoft

Issued Apr 2020 Credential ID: rEB5PaJsKh4sxMXPL
Certified

Initialize Connection

Let's Engineer Solutions Together
agentic_ai_collaboration.py
01 class CollaborationAgent:
02 def __init__(self):
03 self.expertise = ["ML Engineering", "Data Engineering"]
04 self.tech_stack = ["GCP", "Vertex AI", "Terraform"]
05 self.available = True
06
07 def connect(channel):
08 return "Ready to build intelligent solutions!"|
CONNECTED

Professional Network

linkedin.com/in/ghufranakbar

Network: 500+ Connections
Connect via LinkedIn
AVAILABLE

Geographic Node

Nuremberg, Germany

Timezone: CET/CEST
On-site Collaboration Ready
PROCESSING

AI Collaboration Engine

Intelligent Project Matching

Match Score: 98%
View Compatibility Matrix
5 Industries
10+ Tech Stacks
2 Continents
24/7 Innovation

Tic Tac Toe vs AI

Your turn! Click a square to play.

You: 0 | AI: 0