Mustafa Shoukat

Data Scientist & ML Architect

+92 (309) 360 9261 | Lahore, Pakistan

mustafashoukat.ai@gmail.com | LinkedIn | GitHub | Kaggle

Experience


Data Scientist @ COMET Estimating LLC
10/2023 - Present
  • Over one year at COMET Estimating LLC I have led data analysis and ML, DL, NLP, GENAI Modeling and Chatbots projects for strategic business decision.
  • Translate business problems into data science problems. Providing actionable insights and recommendations.
  • Conducting presentations and demonstrations of data science solutions to clients.
  • I develop and implement advanced algorithms including ML, DL NLP and GenAI models.
Business Development & Sales Representative @ COMET Estimating LLC
2022 - 09/2023
  • Identified new business opportunities, resulting in increased revenue and market share for the company.
  • Identifying and reaching out to potential clients. Presenting services to clients. Closing sales deals. Maintain relationships with existing clients. Meeting or exceeding sales targets.

Education & Certification


Bachelor of Computer Science
2023-2027

Virtual University, Lahore

Courses: C++ & Java Programming, Calculus, Linear Algebra, Differential equation, OOP, Data structure and Algorithms.

Nanodegree in AI & ML
08/2023 - 4/2024

Coursera and IBM

Courses: Python, Statistics, Data Science, EDA, Supervised and Unsupervised learning, ML, DL, NLP, and GenAI.

Projects


  • NewsBOT: News Research, News Summarization and Question and Answering Tool
  • Intelligent Conversational Agents Chatmodel with LangChain & RAG
  • Fine-Tuning, PEFT, QLoRA-on-LLaMA-3
  • Dog breeds Classification via DL and Transfer Learning (Computer Vision)
  • Disaster vs Non-Disaster Tweets Classification (NLP)
  • WiDS-24-2 Exploring Equity in Healthcare (Kaggle Competition)
  • Bitcoin Price Predictions 2014-2024-Analysis of Market Trends & Forecasts (Time Series)
  • Emotional Expressions Analysis from Tweets (NLP)
  • Google -GOOG- Stock Prices Prediction_24 (Time Series)
  • Decoding Billionaire Trends Unleashing Kernel PCA

Technical Skills


Languages: Python, Java, SQL

Data Skills: Structured & Unstructured data Cleaning, Handling, Preprocessing and Modeling.

AI Skills: ML, DL, NLP, Transformers, GENAI, RAG, Advance Transfer learning for LLMs.

Tools/Platforms: GitHub, Colab, VS Code, Kaggle, OpenAI, Hugging Face, Langchain and wandb.

Achievements


  • KFUEIT University English Speech Competition - University-wide Certified Winner 2023.
  • Best Performance Award from COMET Estimating LLC (07/2024).
  • AI Enthusiasts Training 50+ students in Data Science, 80+ Students in Marketing and Customer Care.
  • WIDS Dathon 2024 #2 ranked in the top 12% on the leaderboard (Kaggle Competition).
  • Certified Data Scientist from IBM and Coursera 06/2023-04/2024.
  • Solved 30+ Data Science problems on LeetCode.
  • Kaggle Notebook Expert.

Ready to discuss your project requirements?

Get In Touch