ML Engineer | Edge-AI Developer | UAE Golden Visa Holder

Daniel Gebre

Machine Learning Engineer specializing in LLM optimization, distributed ML systems, and deploying AI solutions at scale. Building the future of intelligent applications.

Explore My AI Projects

About

AI Research

Leading edge research in LLM optimization and distributed ML systems

Professional

ML Engineer specializing in production-ready AI solutions at scale

Innovation

UAE Golden Visa holder recognized for exceptional talent in AI technology

Daniel is an Applied Scientist at Inception, a G42 company based in Abu Dhabi, UAE. His work revolves around Agentic RAG systems, AI agent evaluations, and Edge AI - building scalable machine learning systems that bridge the gap between theoretical advancements and practical applications.

He holds an MSc in Machine Learning from MBZUAI and a BSc in Information Technology from Zayed University. Previously, he worked as a Machine Learning Engineer at Technology Innovation Institute/AICCU center, contributing to several cutting-edge projects in LLMs, RAG systems, model compression, and edge inference optimization.

Daniel finds joy in building AI-native systems and isn't just about theory—he's a builder! As a UAE Golden Visa holder, he's passionate about contributing to innovative AI projects that make a real impact, always eager to connect with like-minded professionals and organizations at the forefront of technology and research.

When he's not playing with models, you'll find him on epic phone calls with family or lost in football—kicking it on the field or glued to the game on TV! ⚽

G42 Applied Scientist
TII ML Engineer
MBZUAI MS ML - GPA 3.8
ZU BSc IT - GPA 3.9

Projects

🧠

Evaluation Framework for Agentic RAG

Built comprehensive evaluation framework using DeepEval, RAGAS, and TruLens with 50+ metrics across 4 specialized agents.

50+ Metrics Real-time Dashboard
DeepEval RAGAS TruLens Streamlit
📱

LLM Edge Device Optimization

Built LLM compression pipeline with depth + width pruning, healing using LORA finetuning. 19% reduction, 96% performance.

19% Size Reduction 96% Performance
LORA Pruning Flutter Quantization
☁️

AWS RAG System with Auto-Evaluation

Deployed RAG systems (RAGFLOW and DIFY) on AWS with API integration. Automated evaluation achieving 95% faithfulness.

95% Faithfulness Auto-Evaluation
AWS RAGFLOW DIFY Llama 3.1
📲

Flutter Mobile LLM App

Developed Flutter mobile app to run quantized .gguf LLM models locally on mobile devices with efficient on-device inference.

On-Device Low Latency
Flutter GGUF Mobile AI Quantization
🔧

BERT Parallel Architecture Fine-tuning

Implemented data and tensor parallelism with DeepSpeed on 4 H100 GPUs, reducing training time and increasing throughput by 34%.

34% Faster 4x H100
BERT DeepSpeed H100 Parallel Training
🔍

LLM-Powered Semantic Search Engine

Developed a search engine utilizing LLM and vector space models for efficient document retrieval with advanced semantic understanding.

Vector Search Semantic
LLM Vector DB Embeddings Search

Expertise

🤖
Foundation

AI Foundation

Mathematical foundations, Statistics, Linear Algebra, Probability Theory

🧠
LLM Architectures

LLM Architectures

Transformer architectures, attention mechanisms, positional encodings, multi-head attention, feed-forward networks, layer normalization techniques

Transformers Attention LoRA/QLoRA PEFT
Parallel Computing

Parallel Computing

CUDA, GPU Programming, Data/Tensor/Model/Pipeline parallelism, Multi-GPU optimization

CUDA DeepSpeed FairScale Horovod
📊
Deep Learning

Deep Learning

Neural Networks, CNN, RNN, Transformers, Optimizers, Hyper-parameter Tuning, Model Architecture Design

PyTorch TensorFlow JAX Optuna
🛠️
MLOps

MLOps & Infrastructure

MLFlow, W&B, Docker, Airflow, FastAPI/Flask, AWS SageMaker, automated ML pipelines

MLFlow Docker Kubernetes AWS
🚀
Applications

AI Applications

Production-ready AI systems, Edge deployment, Agentic systems, Real-world impact

AutoGen CrewAI RAG Edge AI
Neural Network Active
Connections: 0 Signal Flow: Active

Honors & Achievements

A journey of continuous learning and recognition in AI & Technology

2019 2021 2022 2023 2024 2025
🎓 Scholarship
Academic Excellence July 2019

Merit Scholarship Recipient

UAE Ministry of Presidential Affairs

Awarded prestigious merit scholarship for outstanding academic performance and leadership potential in technology and innovation.

💎 Elite Recognition
🌐 Certification
Professional December 2021

CCNA Certified Professional

Cisco Systems

Achieved Cisco Certified Network Associate certification, demonstrating expertise in network fundamentals and infrastructure.

🔧 Industry Standard
🏆 Award
Research Excellence December 2022

IEEE Best Paper Award

IEEE Metaverse Conference 2022

Recognized for outstanding research contribution in metaverse technologies and virtual environment security.

📚 Research Impact
🧠 Graduate
Academic Excellence July 2023

Merit Graduate Scholarship

Mohamed bin Zayed University of AI

Awarded graduate scholarship for exceptional performance in AI and machine learning studies at the world's first AI university.

🤖 AI Excellence
📊 Specialization
Specialization September 2024

Deep Learning Specialization

DeepLearning.AI

Completed comprehensive deep learning specialization covering neural networks, CNN, RNN, and advanced architectures.

🔬 Technical Mastery
Internship
Industry Experience October 2024

AI Engineer Internship

Technology Innovation Institute

Successfully completed advanced AI engineering internship, contributing to cutting-edge research and development projects.

🚀 Innovation
8 Achievements
6 Years
1 Patent
100% Excellence

Experience

Jun 2025 - Present

Applied Scientist - Investment AI Team

Group 42/Inception

Developing, evaluating, and deploying Agentic AI models for real-world financial applications. Leading development of domain-specific Finance LLM evaluation frameworks while coordinating with academic and industry partners.

AutoGen CrewAI RAGAS DeepEval TruLens
Jun 2024 - Dec 2024

AI Engineer - Falcon Modeling Team

Technology Innovation Institute, Abu Dhabi

Optimized Falcon Model inference throughput by up to 22X using Pipeline parallelism, CUDA graph optimization, and kernel fusion. Built and deployed Falcon RAG system and Flutter mobile application for edge LLM deployment.

Falcon Models CUDA Pipeline Parallelism Flutter AWS
Mar 2022 - Aug 2023

Blockchain Developer

MBZUAI, Abu Dhabi

Designed comprehensive authentication frameworks spanning multiple technologies. Built integrated systems connecting mobile applications, blockchain, federated learning, and 3D environments. Won IEEE Best Paper Award at IEEE Metaverse-2022 Conference.

Blockchain Federated Learning Authentication Systems 3D Environments
Jan 2022 - Feb 2022

Data Science Intern

Ureka Education Group, Abu Dhabi

Applied data processing techniques including EDA, cleaning, normalization, and feature engineering. Used visualization techniques with PCA to transform raw data into actionable insights.

Data Processing Feature Engineering PCA Data Visualization

What Colleagues Say

"

"Working with Daniel has been transformative for our AI initiatives. His deep understanding of distributed systems and edge deployment has accelerated our product development significantly."

MG

Prof. Mohsen Guizani

Distinguished Professor, MBZUAI

"

"Daniel demonstrated exceptional technical skills and innovation during his time with us. His contributions to LLM optimization, RAG systems, and edge AI deployment were instrumental in advancing our research initiatives."

HH

Dr. Hakim Haci

Chief Researcher, TII/AICCU

Let's Build the Future of AI

Ready to push the boundaries of what's possible with machine learning and AI? Let's collaborate on innovative projects that make a real impact.

📍 Location: Abu Dhabi, UAE

🏆 Status: UAE Golden Visa Holder

🎓 Currently: Applied Scientist at Inception G42