Available for full-time roles

Tarek
Badih

Data scientist and ML engineer who ships real systems — from computer vision classifiers to NLP pipelines — and makes models do things that matter.

1
Startup Launched
22%
Engagement Lift
40%
Review Time Cut
92%
Model Accuracy
2
Internships

Building AI that actually ships

I'm a UC Berkeley Data Science grad (Dec 2025) with a focus on applied ML — the kind that lives in production, not just notebooks. I've deployed computer vision systems, NLP pipelines, and recommendation engines that drove measurable outcomes at real organizations.

My sweet spot is the intersection of modeling and engineering: I care about how a model is built, but I care just as much about how it gets integrated, monitored, and made useful for people who aren't data scientists.

Currently seeking full-time ML Engineer or Data Scientist roles where I can build systems at scale.

[email protected]

What I Work With

Skills & Stack

Tools and techniques I've used on real projects — not just tutorials.

🧠

Machine Learning

scikit-learn, Random Forests, Gradient Boosting, A/B Testing, Feature Engineering

👁️

Computer Vision

Image classification pipelines, TensorFlow, PyTorch, production deployment

💬

NLP & LLMs

Sentiment analysis, prompt engineering, generative AI integration, Claude API

⚙️

Data Pipelines

Python, NumPy, pandas, SQL, automated ETL, daily delivery cadences

📊

Analytics & Viz

Matplotlib, Seaborn, audience segmentation, dynamic pricing, market analysis

🛠️

Engineering

Python, Java, SQL, C, Git, Jupyter, real-time model integration, API deployment

Career

Experience

Where I've built things and what came out of it.

Apr 2026 – Present
Founder
SignalForge · San Jose, CA
  • Built and shipped SignalForge — a resume market intelligence tool that maps live U.S. job opportunities using local AI embeddings, TF-IDF, and cosine similarity scoring.
  • Designed Signal Paths, a strategy layer that converts a ranked shortlist into three next-move cards: Ready Now, Stretch Next, and Unlock More.
  • Deployed full-stack on Vercel with Flask backend, transparent ranking explanations on every job card, and live filters for role, source, fit band, and mode.
PythonFlaskscikit-learn FastEmbedNLPVercel
Apr 2025 – Aug 2025
AI & Machine Learning Intern
Helping Hands · Berkeley, CA
  • Trained recommendation models that increased user engagement by 22% and improved equitable access to nonprofit resources.
  • Built automated end-to-end data pipelines (ingestion → cleaning → feature engineering → model training) delivering analyst-ready data daily.
  • Deployed a computer vision classifier that cut manual donation image review time by 40%.
  • Integrated NLP sentiment models (Claude API) into live outreach workflows, improving donor response rates in real time.
TensorFlowComputer VisionNLP PythonNumPyClaude API
May 2024 – Aug 2024
Data Science Intern
BLCK UNICRN · Los Angeles, CA
  • Built audience segmentation + dynamic pricing model for VR experiences → 12% ticket sales increase.
  • Ran A/B tests on promotional packages that lifted average ticket revenue by 8%.
  • Trained Random Forest and Gradient Boosting models achieving 92% accuracy on user behavior forecasting.
PythonSQLRandom Forests Gradient BoostingA/B TestingJava
Jun 2022 – Jul 2022
AI & Data Hackathon Participant
MIT Hackathon · Cambridge, MA
  • Built humanitarian tech prototypes using Python and applied AI to automate aid-distribution data analysis for NGOs.
  • Delivered functional prototypes and full data analysis on schedule under competition constraints.
PythonpandasApplied AI

Work

Projects

Side projects and competition work that pushed me further than internships did.

SignalForge Live

Resume market intelligence tool that maps live U.S. opportunities using local AI embeddings and semantic scoring. Features Signal Paths — three strategic next-move cards built from the shortlist.

🔗 Live App GitHub
🚌

Transit Route Optimization

Implemented slime mold network simulations in Python to optimize Berkeley transit routes. Integrated 3D city models with biological network algorithms to propose sustainable transport strategies.

Biodesign Challenge 2024
🚁

AI Food Logistics System

Led AI-driven routing and demand prediction for a drone-based food delivery prototype. Used clustering algorithms and shortest-path heuristics with a built-in NLP assistant for stakeholder insights.

🏆 3rd Place — ENJAZ Tech Competition
💬

Donor Sentiment Pipeline

End-to-end NLP system integrating Claude API to analyze donor messages at scale, feeding a real-time outreach decision engine for a nonprofit platform.

Production — Helping Hands
👁️

Donation Image Classifier

Computer vision pipeline deployed in a live nonprofit intake workflow. Reduced manual image review time by 40% through preprocessing optimization and batched inference.

40% time reduction in production

Let's Talk

Get In Touch

Open to full-time ML Engineer and Data Scientist roles. If you're building something interesting, I'd love to hear about it.