Data Specialist · Open to Work

Sukhkirandeep
Kaur Sidhu, Ph.D.

Professor · Data Analyst · Database Specialist

Computer science researcher with a Ph.D. and postdoctoral experience building machine learning models, data pipelines, and BI dashboards. Passionate about turning complex data into decisions.

Ph.D.Computer Science
20+Research Papers
8+Portfolio Projects
15+Tools & Technologies
CABased in Canada
About

Background &
Experience

I am a computer science researcher with a Ph.D. in Computer Science and strong hands-on experience in SQL, Python, and data analytics. My background bridges academia and applied data work — I have taught programming and database systems at Canadian post-secondary institutions, giving me a deep understanding of both the theory and practice of data.

During my postdoctoral research, I worked on machine learning models for image data analysis and developed graphical user interfaces for data processing pipelines. I enjoy building end-to-end data solutions — from raw ETL pipelines and data warehouses to interactive dashboards and predictive models — that support real, data-driven decision making.

Currently seeking roles in Data Analysis, Data Science, or Database Development where I can apply my research depth and hands-on project experience.

LocationBrampton, Canada · Open to Remote
Projects

Featured
Work

01
Azure Data Factory Retail ETL Pipeline
Built an end-to-end Azure Data Factory ETL pipeline to move Mockaroo-generated retail CSV data from Azure Blob Storage into Azure SQL Database. The pipeline loads data into staging tables, runs a SQL stored procedure to transform the data, and validates the final reporting table using SQL queries.
Azure Data FactoryAzure Blob StorageAzure SQL DatabaseSQLMockarooETL
→ View on GitHub
02
SQL Server to Azure SQL Database Migration Project
Migrated a local SQL Server retail database to Azure SQL Database using Python and pyodbc. Created SQL tables, stored procedures, validation queries, and data-quality checks to verify successful migration.
SQL ServerAzure SQL DatabasePythonSQLETL
→ View on GitHub
03
CRM Sales Data Lakehouse using Databricks
Built an end-to-end CRM sales analytics lakehouse using Databricks, PySpark, Delta Lake, and SQL. Designed Bronze, Silver, and Gold data layers to process customer, sales, product, employee, and support ticket data. Created Gold analytics tables and a Databricks dashboard to analyze revenue, customer value, product performance, regional sales, and support ticket trends.
DatabricksPySparkDelta LakeSQLETLData ENgineeringDashboard
→ View on GitHub
04
Canadian Data Analyst Job Market Analysis
Built a data analysis and automation project to study the Canadian data analyst job market. I collected job postings from public APIs, cleaned and analyzed the data using Python and Pandas, visualized insights in Tableau, and later extended the project with an n8n workflow to automate weekly job collection, duplicate removal, Google Sheets updates, and new job alerts.
PythonWeb ScrapingPandasTableauREST APINLPn8nJavascriptAutomation
📊 Live Dashboard → GitHub
05
Energy Production Data Warehouse & Analytics Dashboard
Designed and implemented an end-to-end SQL Server data warehouse to track and analyze asset downtime across energy production facilities. Built a fully automated ETL pipeline to ingest, clean, and transform raw operational data into structured fact and dimension tables. Developed an interactive Power BI dashboard that reduced manual reporting time by ~60% and enabled teams to identify high-downtime assets and pinpoint failure trends.
SQL ServerETLPower BIData Warehousing
→ View on GitHub
06
Hospital Admission Analysis
Analyzed 200,000+ New York hospital admission records to uncover patterns in patient volume, length of stay, and resource utilization across departments. Designed an interactive Tableau dashboard with drill-down filters by hospital, diagnosis, and time period. Key findings revealed significant weekend admission spikes and seasonal trends to inform staffing and bed allocation strategies.
TableauHealthcare AnalyticsEDA
→ View on GitHub
07
Fake Job Classifier
Built a machine learning classifier to detect fraudulent job postings, trained on 17,000+ job listings. Engineered features using TF-IDF and trained a Logistic Regression model with SMOTE to handle class imbalance, achieving AUC = 0.96. Deployed as a fully interactive Streamlit web application — paste any job description and get an instant real/fake prediction.
PythonNLPScikit-learnStreamlit
🚀 Live App → GitHub
08
Employees Sales Performance Insights
Developed a complete BI pipeline simulating a realistic multi-channel sales environment. Used PL/SQL stored procedures to automate data aggregation, calculate KPIs, and generate ranked employee summaries across regions and product lines. Delivered a Tableau dashboard giving sales managers a clear view of top performers, underperforming channels, and revenue distribution.
TableauPL/SQLBusiness IntelligenceData Aggregation
→ View on GitHub
09
Crime Severity Index in Canadian Provinces
Conducted a multi-year analysis of Statistics Canada's Crime Severity Index across all Canadian provinces from 2019–2023, with a focused 2022 vs. 2023 year-over-year comparison. Built a Tableau visualization suite featuring choropleth maps and trend lines that communicate regional disparities — useful for policy researchers, journalists, and public safety analysts.
TableauPublic DataData StorytellingCanadian Stats
→ View on GitHub
Research

Academic
Contributions

Publications
20+ Peer-Reviewed Papers
Authored and co-authored over 20 research papers published in reputable academic journals and international conferences. Research areas span wireless sensor networks, machine learning, computer vision, network security, and data analysis.
JournalsConferencesPeer-Reviewed
Postdoctoral Research
Forensic ML at West Virginia University
Worked on a CSAFE research grant at the Department of Forensic and Investigative Science. Developed a deep learning model using semantic segmentation to automatically extract breech face areas from scanned cartridge case images. Built a PyQT5 GUI and web-based interface to streamline workflows for forensic firearm examiners.
Deep LearningComputer VisionPyTorchPyQT5
Research Grant
Cybersecurity Forensics — Checkpoint Technologies
Secured competitive research funding from Checkpoint Software Technologies Pvt. Ltd. as Principal Investigator. Designed a framework for BYOD Cyber Security Forensic Readiness Infrastructure, combining network forensics and security policy design.
CybersecurityBYODForensicsPI-Funded
Ph.D. Supervision
Mentored Researchers at LPU
Supervised two Ph.D. dissertations and guided students in developing independent research projects, collectively resulting in 9 published papers. Research topics spanned machine learning, networking, and cybersecurity.
Ph.D. SupervisionMentorship9 Student Papers
Education

Academic
Background

2018
Ph.D.
Computer Science & Engineering
National Institute of Technology, Srinagar, India
Dissertation: "Improving Network Lifetime of Wireless Sensor Networks using Load Balancing Techniques"
Coursework: Wireless Sensor Networks · Data Mining & Big Data Analytics · Statistics · Distributed Systems · Research Methodology
2012
M.Tech
Masters in Technology, Computer Science & Engineering
Lovely Professional University, India
Thesis: "Image Segmentation using Bacteria Foraging Optimization"  ·  70.29%
2010
B.Tech
Bachelors in Technology, Computer Science & Engineering
Punjabi University, India
71%
Expertise

Skills &
Technologies

Languages
Python
SQL / PL/SQL
R
C / C++
Databases
SQL Server
PostgreSQL
Oracle / PL/SQL
MongoDB (NoSQL)
Visualization & BI
Tableau
Power BI
Matplotlib
Streamlit
Machine Learning & AI
Scikit-learn
TensorFlow / PyTorch
LightGBM / NLP
OpenCV / Computer Vision
Data Engineering
ETL Pipelines
Data Warehousing
Pandas / NumPy
Advanced Statistics
Cloud & Tools
Azure
Git / GitHub
Flask / PyQT5
Jupyter Notebooks
EnglishFull Professional
PunjabiNative / Bilingual
HindiNative / Bilingual
Contact

Get in
Touch

I'm currently open to data analyst, data science, and database roles in Canada. Feel free to reach out if you'd like to connect or discuss an opportunity.