Hi, I'm Jahid Hasan. As an experienced Data Scientist and ML Engineer with 6+ years of professional experience, I have a strong background in building machine learning models, scalable data pipelines, and production-ready ML systems. I am a Kaggle Grandmaster, placing me in the top 0.1% of data scientists globally, reaching that level in just 6 months by publishing over 20 highly ranked notebooks covering credit risk, healthcare, and clean energy. Currently completing my MS in Data Science at Eastern University with a 4.00 GPA, expected May 2026.
At Ludwig Pfeiffer, I built automated reporting pipelines that cut generation time by 60%, deployed production-grade REST APIs for ML models using Django REST Framework, and designed computer vision systems for automated pipeline inspection. I also built database management systems handling 100K+ records in MySQL and PostgreSQL and implemented Microsoft Power Apps and Power Automate solutions that improved process efficiency by 40%.
At TAPPWARE Solutions, I built e-governance data pipelines processing 500K+ records daily at 99.9% accuracy and deployed personalized recommendation models achieving 85%+ accuracy for a government e-learning platform. I worked with large-scale datasets, applied NLP and word embedding techniques, and built scalable REST APIs to serve model predictions in real time using Django REST Framework and PostgreSQL.
At Qtec Solutions, I developed and fine-tuned sentiment analysis models achieving 82% accuracy for marketing strategy optimization, built automated web scraping pipelines using Python and BeautifulSoup, and designed data warehousing solutions using dimensional modeling and ETL processes. I applied classification, regression, clustering, and ensemble methods to generate business insights from large-scale datasets.
Beyond my professional work, I published peer-reviewed research at CLEF 2025 on cross-lingual subjectivity detection using multilingual transformers. I write on Medium with 13 published articles on Python and data visualization, maintain an open source R and ggplot2 theme on CRAN with 1,200+ downloads, and am writing Data Science Mastery: From Fundamentals to Professional Practice, a 45 chapter book covering statistics, probability, machine learning, and real world applications.
I'm in for both research and development. Currently doing my graduate thesis work on Big Data Mining, Digital Image Processing, and Artificial Intelligence. I've listed some other topics even though they are out of my league. I hope to work on these in the future.
Research on multilingual transformer models and their cross-domain transfer capabilities for detecting subjectivity in news articles across different languages. This work explores the effectiveness of modern NLP architectures in handling cross-lingual subjectivity detection tasks.
Behavioral patterns, segmentation, and trend discovery using advanced data visualization techniques.
Identifying high-value customers and evaluating marketing effectiveness through data-driven analysis.
Watch these interactive terminal recordings to see real-time demonstrations of database operations and Python scripting
View More on Asciinema
A beautiful, minimal color theme for developers
A carefully crafted color scheme designed to reduce eye strain while maintaining excellent readability. Features warm, muted tones inspired by natural pine forests at dusk.
Explore my technical blog featuring tutorials, data science insights, machine learning projects, and software engineering best practices.
Visit BlogExplore my tutorials, technical content, and educational resources across multiple platforms