Who am I

👋 Hi, my name is Mohammed Omar Khan and I live in the UAE.

👨‍💻 I'm an aspiring Data Scientist with a passion for analysing and working with data.

💻 I am Microsoft certified in Azure Data and Oracle Certified Foundations Associate in Java.

🤖 Check out my portfolio under Projects or on my portfolio page.

Skills

C
SQL
Java
Python
DAX
C++
HTML
JavaScript
CSS
Bash Scripting
NLP
pandas
Numpy
Scikit-learn
TensorFlow
Keras
Matplotlib
Keras
Git
Bash
PowerBI
Tableau
Excel
ElasticSearch
Postman
Jupyter
Hadoop
Node.js

Education & Experience

For more information, have a look at my CV.

  • March 2024 - Present - Abu Dhabi Police
    Project Engineer
    Solaris Bash Elasticsearch Openshift
  • May 2023 - Present - Midis Group
    TSS Intern
    Oracle Databases Hadoop Hive SQL
  • Sep 2022 - May 2023 - Heriot-Watt University
    Intern
    Python SQL Java Data Science
  • B.S. Computer Science (with Artificial Intelligence)
    GPA - 3.7/4.0 Deputy' Principal's Award

Projects

In this project, we scrape data from Bayut.com to create a dataset of over 50 thousand listings to analyse real estate prices in Dubai. We visualized the average prices for each area using PowerBI and used this report to identify the overall market rates by neighborhood.

Python pandas Web Scraping PowerBI

The aim of this project is to create a machine learning model that can classify URLs as ads or trackers.To this end, we have tried various machine learning algorithms to find the most promising algorithm for this project.

Python Machine Learning Random Forests XGBoost SVM Logistic Regression LightGBM

This project aims to visualise and analyse the effect of certain promotions on customer conversion. Using this information, the website can identify users with the most conversion potential and can understand the effects of different types of offers.

Python Machine Learning XGBoost seaborn GridSearch Logistic Regression

Analysis of dataset containing over 10,000 tweets to identify the sentiment expressed with the help of natural language processing, using Tensorflow and graphs to provide visual representation of collected data, achieving a sentiment classification accuracy of 92% using various Machine Learning algorithms such as neural networks, logistic regression, XGBoost and Random forests.

Neural Networks NLP Logistic Regression Random Forests

Analysis of dataset containing over 227,000 Amazon reviews to identify customer satisfaction with the product with the help of natural language processing, using Tensorflow and graphs to provide visual representation of the data, achieving sentiment classification of 90% and identified overall customer satisfaction with various products using various Machine Learning algorithms such as neural networks, regression, XGBoost , LightGBM, Random forests, SVM classifiers and hyper parameter tuning.

Neural Networks NLP Logistic Regression Random Forests

Contact

📧 Email: mkhan0138@gmail.com

☎️ Mobile: +971 56 286 1951