Jahid Hasan

I'm a Data Scientist

About Me

Hi, I'm Jahid Hasan. As an experienced software engineer with a deep passion for data science, machine learning, and artificial intelligence, I'm dedicated to developing innovative solutions that leverage data to solve complex problems. I have a strong foundation in web application development, database management, and data analysis, with a proven track record in both junior and senior roles in the software industry. Currently, I'm pursuing an MSc in Data Science at Eastern University, where I'm expanding my expertise in advanced data analysis, machine learning algorithms, and AI technologies.

At Ludwig Pfeiffer, I enhanced my skills in web application development and database management, working with tools like SQL, Pandas, Power BI, and Excel. My experience also includes leveraging software development tools such as Microsoft Power Apps, SharePoint, Power Automate, and Google AppSheet. During this time, I built web solutions and optimized data processes, gaining valuable insights into automation and workflow efficiencies.

During my time at Tappware Solutions, I focused on data analysis and pipeline development for large-scale, e-governance projects. Here, I worked with big data techniques, predictive modeling, and machine learning web APIs. I implemented REST APIs for machine learning models using Django REST Framework and PostgreSQL, solidifying my skills in creating scalable, data-driven applications.

At Qtec Solutions, I gained extensive experience with machine learning and statistical techniques, including data mining, sentiment analysis, predictive analytics, and natural language processing. I utilized tools such as MySQL, Excel, and Python to develop strategic insights for marketing and customer sentiment analysis. My projects often involved web crawling, data pre-processing, and parsing complex HTML and XML documents.

Current Role Data Science Contributer
Company Kaggle
Education MS in Data Science
Kaggle Status Grandmaster
Location Maryland, US

Personal Interests

Software Development

Machine Learning

Database Design and Architecture

Large Language Model

Statistical Analysis

Data Science

Problem-Solving Techniques

Generative AI

Research Interests

I'm in for both research and development. Currently doing my graduate thesis work on Big Data Mining, Digital Image Processing, and Artificial Intelligence. I've listed some other topics even though they are out of my league. I hope to work on these in the future.

Artificial Neural Network (ANN)

Recurrent Neural Network (RNN)
Convolutional Neural Network (CNN)
Neural Network Optimization
LSTM (Long Short Term Memory) Network

Computer Vision & Digital Image Processing

Facial and Emotion Recognition
Blob Detection

Digital Signal Processing & Cognitive Science

EEG & EMG Analysis
Speech Recognition
Medical Imaging
Computer Graphics

Natural Language Processing

Native Natural Language Processing Toolkit
Text-based Emotion Analysis
News Analysis using NLP

Deep Learning

Deep Learning using Theano, TensorFlow, and Torch

Big Data Mining and Cloud Computing

Distributed Data Processing
Scalable Machine Learning Algorithms
Cloud-based Analytics

Internet of Things (IoT)

IoT Security and Privacy
Smart Systems and Automation
Sensor Networks and Edge Computing

Education

Eastern University

MS in Data Science

January 2024 - Present In Progress
Relevant Coursework
Introduction to Statistical Modeling Data Analytics in R Data Manipulation Applied Machine Learning Natural Language Processing

Southeast University

B.Sc. in Computer Science & Engineering

September 2013 - December 2017 Completed
Relevant Coursework
Database Design Artificial Intelligence Statistical Methods & Probability Image Processing Data Mining

Publications

SmolLab SEU at CheckThat! 2025: How well do multilingual transformers transfer across news domains for cross-lingual subjectivity detection

September 26, 2025 | CLEF - Conference and Labs of the Evaluation Forum

Research on multilingual transformer models and their cross-domain transfer capabilities for detecting subjectivity in news articles across different languages. This work explores the effectiveness of modern NLP architectures in handling cross-lingual subjectivity detection tasks.

Natural Language Processing Multilingual Transformers Cross-lingual Analysis

Professional Experience

Ludwig Pfeiffer Hoch- und Tiefbau GmbH & Co. KG

Programmer
January 2021 - December 2023
  • Led development, maintenance, and optimization of web applications and database management systems.
  • Created data-driven reports using Excel, Google Sheets, SQL, Pandas, and Power BI to support business decisions.
  • Developed innovative solutions including Pipeline Inspection Robots and QR code generators using data science techniques.
  • Built predictive models using Python, machine learning, and deep learning to enable automation.
  • Designed APIs using Django REST Framework and GraphQL for seamless front-end and back-end integration.
Skills
Data Science & Analytics
Python Pandas Machine Learning Deep Learning
Data Visualization
Tableau Power BI Excel Google Sheets
Web Development & Backend
Node.js Express.js Django REST Framework GraphQL
Database Management
MongoDB PostgreSQL
System & Cloud
Linux Digital Ocean

Tappware Solutions Limited

Assistant Software Engineer
September 2019 - December 2020
  • Worked on data pipelines, word embedding, and analysis of complex datasets using advanced querying techniques.
  • Developed analytics and algorithms for e-governance and e-learning platforms.
  • Built predictive models using TensorFlow and Scikit-learn for real-world applications.
  • Designed and deployed machine learning REST APIs using Django REST Framework.
  • Analyzed Japanese healthcare data to provide insights for medical research and optimization.
Skills
Programming & Tools
Python Flask Linux
Data Science & Analytics
NumPy Pandas TensorFlow Scikit-learn
Data Visualization
Seaborn Matplotlib
Cloud & Database
PostgreSQL SQLite Heroku

Qtec Solution Limited

Junior Software Engineer
January 2018 - August 2019
  • Applied machine learning techniques including classification, regression, clustering, and ensemble methods.
  • Performed web crawling, data scraping, preprocessing, and API integration.
  • Developed sentiment analysis and data mining solutions for marketing intelligence.
  • Designed ETL pipelines and big data warehousing solutions.
  • Delivered business insights using MySQL, Excel, and visualization tools.
Skills
Data Science
Machine Learning Deep Learning Sentiment Analysis
Big Data
Hadoop Apache Spark ETL
Web & APIs
FastAPI Scrapy BeautifulSoup Selenium

Grameen Intel Social Business Limited

Software QA Intern
August 2017 - December 2017
  • Designed and executed SQA test plans, test cases, and test scripts.
  • Tracked and documented bugs using Jira and TestLink.
  • Collaborated with developers to verify fixes and ensure software quality.
  • Performed cross-platform testing on Windows, Linux, and macOS.
Skills
SQA & Testing
Software Testing Test Planning Bug Tracking Mobile Testing
Tools
Jira TestLink

Projects

Netflix Movie Analysis

Netflix Movie Analysis

View Details
Music Sales Analysis

Music Sales Analysis

View Details
AI Survey Analysis

AI Survey Analysis

View Details
EV Charging Station Analysis

EV Charging Station Analysis

View Details
Smartphone Data Insights

Smartphone Data Insights

View Details
Global AI Salary Dive

Global AI Salary Dive

View Details
Dhaka Urban Population

Dhaka Urban Population

View Details
Quality of Life Index 2024

Quality of Life Index 2024

View Details
Walmart Sales Analysis

Walmart Sales Analysis

View Details
Bangladesh Road Accidents

Bangladesh Road Accidents

View Details
Student Information System

Student Information System

View Details

DASH VIEW

Region Wise Sales Dashboard

Region Wise Sales Dashboard

Interactive Sales Analytics Dashboard

Real-time visualization of regional sales performance with interactive filtering and drill-down capabilities

Power BI Excel Business Intelligence

DYNAMIC REPORTS

Customer Insights and Trends Analysis

Behavioral patterns, segmentation, and trend discovery using advanced data visualization techniques.

Python Seaborn

Customer Profitability and Marketing Analysis

Identifying high-value customers and evaluating marketing effectiveness through data-driven analysis.

R Markdown

Exploratory Data Analysis of Netflix Movies

A Hands-On Approach in R

R

Canada Immigration Insights

Visualizing Key Trends and Data

Python

SCRIPT VISION

Pandas, PostgreSQL, SQLAlchemy, Pgcli

PostgreSQL, Create table, insert values

Watch these interactive terminal recordings to see real-time demonstrations of database operations and Python scripting

View More on Asciinema

Open Source Packages

Online Certifications

Machine Learning

Machine Learning

View Certificate
Applied Machine Learning in Python

Applied Machine Learning in Python

View Certificate
Natural Language Processing in TensorFlow

Natural Language Processing in TensorFlow

View Certificate
Python 101 for Data Science

Python 101 for Data Science

View Certificate
Python Core

Python Core

View Certificate
SQL for Data Science

SQL for Data Science

View Certificate
Data Science Math Skills

Data Science Math Skills

View Certificate
Learn The Linux Command Line

Learn The Linux Command Line: Basic Commands

View Certificate
Data Science

Data Science

View Certificate
Automate the Boring Stuff with Python

Automate the Boring Stuff with Python Programming

View Certificate
Intro to Programming

Intro to Programming

View Certificate
Introduction to Data Science

Introduction to Data Science

View Certificate
Crash Course on Python

Crash Course on Python

View Certificate
Convolutional Neural Networks in TensorFlow

Convolutional Neural Networks in TensorFlow

View Certificate
Python for Beginners

Python for Beginners: Complete Python Programming

View Certificate
Machine Learning

Machine Learning

View Certificate
Python (Basic)

Python (Basic)

View Certificate
Introduction to Data Analytics

Introduction to Data Analytics

View Certificate

Technical Skills

Programming Languages

Python R SQL Java C++ MATLAB JavaScript

Database

MySQL PostgreSQL MongoDB Oracle SQLite SQLAlchemy

Data Science Libraries & Tools

Pandas NumPy SciPy Statsmodels

Machine Learning & Deep Learning Frameworks

Scikit-learn XGBoost LightGBM CatBoost TensorFlow PyTorch Keras OpenCV

Natural Language Processing (NLP)

NLTK SpaCy Hugging Face Transformers Word Embeddings

Data Visualization & BI

Matplotlib Seaborn Plotly ggplot2 PowerBi Tableau Streamlit

Statistical Analysis & Mathematics

Hypothesis Testing A/B Testing Regression Analysis Time Series Experimental Design Linear Algebra

Big Data & Cloud

Apache Spark AWS S3 EC2 Docker Kubernetes ETL Data Warehousing

Web Development

HTML5 CSS3 JavaScript Bootstrap Django Flask FastAPI Node.js Express.js Django REST BeautifulSoup

DevOps & Deployment

Docker Heroku Netlify CI/CD Feature Engineering Model Deployment MLOps

Tools & Platforms

Git GitHub Jupyter Linux VS Code Postman Swagger

Microsoft Power Platform

Power BI Power Apps Power Automate SharePoint

Methodologies

Agile Scrum CI/CD Feature Engineering Model Deployment MLOps

Scripting & Automation

Bash Scrapy Selenium

IDEs & Text Editors

Neovim Jupyter VS Code

Design & Prototyping

Figma GIMP

Markup Languages & Documentation

Markdown LaTeX JSON

Security & Operating Systems

Ubuntu Kali Linux Linux

Other Skills

LibreOffice JWT Pgcli bpython

Blog

Technical Blog

Explore my technical blog featuring tutorials, data science insights, machine learning projects, and software engineering best practices.

Visit Blog

Get In Touch

Learn With Me

Explore my tutorials, technical content, and educational resources across multiple platforms

Send Me a Message