top of page
Search

Introduction

The field of Data Science continues to expand in depth and influence in 2025. With the rise of Artificial Intelligence (AI), machine learning, and big data technologies, the demand for skilled data professionals is greater than ever. However, the challenge most aspiring data scientists face is not the lack of information—it’s the lack of structure. Without a clear roadmap, it's easy to feel overwhelmed.

That’s why we at StatQuestJourney Hub have curated the ultimate Data Science Roadmap for 2025—a month-by-month, hands-on guide for mastering data science skills from beginner to advanced level in just 3+ months. And if you're looking to get started, our three-month intensive course for only Ksh 20,000 (with flexible installment options) is the perfect gateway.

Let’s dive into the roadmap! you can request the roadmap PDF by Contacting Shem here.




📅Master the Fundamentals

The first thing is all about building a strong foundation in programming, data manipulation, and basic statistics. Here's what to focus on:

🐍 Python Programming

Python remains the dominant language in data science. Begin with:

  • Data types, control flow, and functions

  • Libraries like Pandas and NumPy for data handling

  • Visualization using Matplotlib and Seaborn

  • Data cleaning techniques including SELECT/JOIN operations

📊 Power BI or Tableau

Learn how to build interactive dashboards and communicate insights effectively.

☁️ Cloud Basics

Get hands-on with:

  • AWS EC2 for computing

  • Amazon S3 for storage

📈 Basic Statistics

Understand the statistical underpinnings:

  • Probability distributions

  • Hypothesis testing

🤖 Generative AI Tools

Learn how to use:

  • ChatGPT

  • Claude to assist in research, code generation, and ideation.

🔍 Project

Complete a small but impactful project—such as a Sales Analysis Dashboard or simple Database Analysis.

📚 Recommended Read: Practical Statistics for Data Scientists by Peter & Andrew Bruce.

📅Machine Learning Fundamentals

This is the core of data science: prediction and pattern recognition.

🧠 Supervised Learning

  • Linear and Logistic Regression

  • Decision Trees & Random Forests

  • Use the Scikit-learn library extensively

🔍 Unsupervised Learning

  • Clustering (K-Means, DBSCAN)

  • Dimensionality Reduction (PCA)

🧠 Deep Learning Introduction

  • ANNs, CNNs, RNNs

  • Transformers for sequential and visual data

You’ll work with Kaggle datasets to practice your newly acquired skills.

📚 Recommended Reads:
  • Hands-On ML with Scikit-Learn, Keras & TensorFlow – Aurelien Geron

  • Machine Learning with PyTorch and Scikit-Learn – Sebastian Raschka et al.

📅Model Deployment and Monitoring

What’s the use of building models if no one can access them?

🐳 Deployment Stack

  • Docker & Kubernetes for containerization

  • FastAPI or Flask to expose models as REST APIs

  • GitHub for version control

📈 Monitoring

  • MLflow for experiment tracking

  • Prometheus and Grafana for system monitoring

📚 Book: Building ML Pipelines by Hannes Hapke & Catherine Nelson

📅Real-World Experience – Internship

Now that you have the skills, it’s time to apply them.

  • Search for internships via LinkedIn, Indeed, or participate in hackathons

  • Join communities and contribute to open-source projects

📚 Recommended Book: Data Science Handbook by Carl Henry Wang, William Chen, and Max Song

📅Pick a Specialization (NLP or CV)

At this point, specialization is essential. Pick a track that excites you.

📝 Natural Language Processing (NLP)

  • Named Entity Recognition, Summarization, Topic Modeling

  • Vectorization methods: TF-IDF, Word2Vec, GloVe

  • Transformers & Attention Mechanisms

📚 Books:
  • NLP in Action by Hobson Lane et al.

  • NLP with Transformers by Lewis Tunstall et al.

🖼️ Computer Vision (CV)

  • Object detection, segmentation

  • Real-time projects: customer queue detection, surveillance, etc.

  • Use TensorFlow and PyTorch

📚 Book: Deep Learning and Vision Systems by Mohamed Elgendy

📅Cutting-Edge Tech – LLMs & Diffusion Models

🔠 For NLP Track:

  • Architectures like GPT-4, LLAMA, T5

  • Tasks: Text summarization, chatbots, RAG systems

  • Learn LoRA/QLoRA for efficient fine-tuning

📚 Book: Quick Start Guide to LLMs by Sinan Ozdemir

🎨 For CV Track:

  • Learn about Noise Scheduling, Reverse Processes

  • Work on Image Generation, Inpainting, Style Transfer

  • Tools: Stable Diffusion, DreamBooth

📚 Book: Hands-on Generative AI with Transformers & Diffusion Models by Omar Sanseviero et al.

📚 Additional Topics (Not to Miss!)

  • R Programming: Great for statistical modeling and research

  • Git: Essential for version control and team collaboration

  • Data Structures & Algorithms: For logical problem-solving and technical interviews

  • SQL Variants: MySQL, PostgreSQL, SQL Server

  • Big Data Tools (optional): Apache Spark, Hadoop

  • Advanced Visualizations: Plotly, Dash

🌟 Why Choose StatQuestJourney Hub?

At StatQuestJourney Hub, we don’t just teach you skills—we prepare you for a career.

✅ Our Unique Value:

  • Well-aligned curriculum based on industry trends (just like the roadmap above)

  • Friendly and hands-on learning environment

  • Real-world projects, mentorship, and community engagement

  • Affordable course fee: Ksh 20,000 for 3 months

  • Flexible payment in 2, 3 or 4 installments

🎯 Who Is This For?

  • Students and fresh graduates looking to upskill

  • Professionals transitioning into data science

  • Freelancers who want to gain an edge

🚀 Enroll Now – Start Your Data Science Journey Today

The Data Science world waits for no one. If you want to stand out in the field, you need the right skills, structure, and support—and that’s exactly what StatQuestJourney Hub offers.

➡️ Course Fee: Ksh 20,000➡️ Duration: 3 months (Open & Private Sessions)➡️ Payment Flexibility: 2, 3, or 4 installments➡️ Contact: statquestjourney@gmail.com | +254768944928

Whether you’re just starting or looking to specialize, there’s no better time than now to future-proof your career in data science.



The SQL WHERE clause allows to filtering of records in queries. Whether you’re retrieving data, updating records, or deleting entries from a database, the WHERE clause plays an important role in defining which rows will be affected by the query. Without it, SQL queries would return all rows in a table, making it difficult to target specific data.

In this article, we will learn the WHERE clause in detail—from basic concepts to advanced ones. We’ll cover practical examples, discuss common operators, provide optimization tips, and address real-world use cases.

What is the SQL WHERE Clause?

The SQL WHERE clause is used to specify a condition while fetching or modifying data in a database. It filters the rows that are affected by the SELECT, UPDATE, DELETE, or INSERT operations. The condition can range from simple comparisons to complex expressions, enabling precise targeting of the data.

Syntax:

SELECT column1,column2 FROM table_name WHERE column_name
operator value;

Parameter Explanation:

  1. column1,column2: fields in the table

  2. table_name: name of table

  3. column_name: name of field used for filtering the data

  4. operator: operation to be considered for filtering

  5. value: exact value or pattern to get related data in the result 

Examples of WHERE Clause in SQL

We will create a basic employee table structure in SQL for performing all the where clause operation.

Query:

CREATE TABLE Emp1(    EmpID INT PRIMARY KEY,    Name VARCHAR(50),    Country VARCHAR(50),    Age int(2),  mob int(10));-- Insert some sample data into the Customers tableINSERT INTO Emp1 (EmpID, Name,Country, Age, mob)VALUES (1, 'Shubham',  'India','23','738479734'),       (2, 'Aman ',  'Australia','21','436789555'),       (3, 'Naveen', 'Sri lanka','24','34873847'),       (4, 'Aditya',  'Austria','21','328440934'),       (5, 'Nishant', 'Spain','22','73248679');

Output

Example 1: Where Clause with Logical Operators

To fetch records of  Employee with age equal to 24. 

Query:

SELECT * FROM Emp1 WHERE Age=24;

Output:

To fetch the EmpID, Name and Country of Employees with Age greater than 21. 

Query:

SELECT EmpID, Name, Country FROM Emp1 WHERE Age > 21;

Output:

Example 2: Where Clause with BETWEEN Operator

It is used to fetch filtered data in a given range inclusive of two values. 

Syntax: 

SELECT column1,column2 FROM table_name  WHERE column_name BETWEEN value1 AND value2;

Parameter Explanation:

  1. BETWEEN: operator name 

  2. value1 AND value2: exact value from value1 to value2 to get related data in result set.  

To fetch records of Employees where Age is between 22 and 24 (inclusive).

Query:

SELECT * FROM Emp1 WHERE Age BETWEEN 22 AND 24;

Output:

Example 3: Where Clause with LIKE Operator

It is used to fetch filtered data by searching for a particular pattern in the where clause. 

Syntax: 

SELECT column1,column2 FROM table_name WHERE column_name LIKE pattern;

Parameters Explanation:

  1. LIKE: operator name 

  2. pattern: exact value extracted from the pattern to get related data in the result set. 

Note: The character(s) in the pattern is case-insensitive.

To fetch records of Employees where Name starts with the letter S.

Query:

SELECT * FROM Emp1 WHERE Name LIKE 'S%'; 

The ‘%'(wildcard) signifies the later characters here which can be of any length and value. 

Output:

To fetch records of Employees where Name contains the pattern ‘M’.

Query:

SELECT * FROM Emp1 WHERE Name LIKE '%M%';

Output:

Example 4: Where Clause with IN Operator

It is used to fetch the filtered data same as fetched by ‘=’ operator just the difference is that here we can specify multiple values for which we can get the result set.

Syntax: 

SELECT column1,column2 FROM table_name WHERE column_name IN (value1,value2,..);

Parameters Explanation:

  1. IN: operator name 

  2. value1,value2,..: exact value matching the values given and get related data in the result set.

To fetch the Names of Employees where Age is 21 or 23.

Query:

SELECT Name FROM Emp1 WHERE Age IN (21,23);

Output:


List of Operators that Can be Used with WHERE Clause

Conclusion

The WHERE clause is use for filtering and refining SQL queries. Whether you’re working with basic conditions, using logical operators, or performing advanced queries with subqueries and EXISTS, mastering the WHERE clause is essential for every SQL user. Understanding how to efficiently filter data, avoid common pitfalls, and optimize your queries will ensure you’re able to write clean, fast, and accurate SQL queries.

SQL joins are the foundation of database management systems, enabling the combination of data from multiple tables based on relationships between columns. Joins allow efficient data retrieval, which is essential for generating meaningful observations and solving complex business queries.

Understanding SQL join types, such as INNER JOIN, LEFT JOIN, RIGHT JOIN, FULL JOIN, and NATURAL JOIN, is critical for working with relational databases.

In this article, we will cover the different types of SQL joins, including INNER JOIN, LEFT OUTER JOIN, RIGHT JOIN, FULL JOIN, and NATURAL JOIN. Each join type will be explained with examples, syntax, and practical use cases to help us understand when and how to use these joins effectively.

What is SQL Join?

SQL JOIN clause is used to query and access data from multiple tables by establishing logical relationships between them. It can access data from multiple tables simultaneously using common key values shared across different tables. We can use SQL JOIN with multiple tables. It can also be paired with other clauses, the most popular use will be using JOIN with WHERE clause to filter data retrieval.

Example of SQL JOINS

Consider the two tables, Student and StudentCourse, which share a common column ROLL_NO. Using SQL JOINS, we can combine data from these tables based on their relationship, allowing us to retrieve meaningful information like student details along with their enrolled courses

Student Table

StudentCourse Table

Both these tables are connected by one common key (column) i.e ROLL_NO. We can perform a JOIN operation using the given SQL query:

Query:

SELECT s.roll_no, s.name, s.address, s.phone, s.age, sc.course_idFROM Student sJOIN StudentCourse sc ON s.roll_no = sc.roll_no;

Output

Types of JOIN in SQL

There are many types of Joins in SQL. Depending on the use case, we can use different type of SQL JOIN clause. Below, we explain the most commonly used join types with syntax and examples:

  • INNER JOIN

  • LEFT JOIN

  • RIGHT JOIN

  • FULL JOIN

  • Natural Join

    Key Terms

    • table1: First table.

    • table2: Second table

    • matching_column: Column common to both the tables.

    Note: We can also use RIGHT OUTER JOIN instead of RIGHT JOIN, both are the same.

  • RIGHT JOIN Example

    In this example, the RIGHT JOIN retrieves all rows from the StudentCourse table and the matching rows from the Student table based on the ROLL_NO column.

    Query:

    SELECT Student.NAME,StudentCourse.COURSE_ID FROM StudentRIGHT JOIN StudentCourse ON StudentCourse.ROLL_NO = Student.ROLL_NO;

OUTPUT

4. SQL FULL JOIN

FULL JOIN creates the result-set by combining results of both LEFT JOIN and RIGHT JOIN. The result-set will contain all the rows from both tables. For the rows for which there is no matching, the result-set will contain NULL values.

SYNTAX
SELECT table1.column1,table1.column2,table2.column1,....FROM table1 FULL JOIN table2ON table1.matching_column = table2.matching_column;

Key Terms

  • table1: First table.

  • table2: Second table

  • matching_column: Column common to both the tables.

FULL JOIN Example

This example demonstrates the use of a FULL JOIN, which combines the results of both LEFT JOIN and RIGHT JOIN. The query retrieves all rows from the Student and StudentCourse tables. If a record in one table does not have a matching record in the other table, the result set will include that record with NULL values for the missing fields

Query:

SELECT Student.NAME,StudentCourse.COURSE_ID FROM StudentFULL JOIN StudentCourse ON StudentCourse.ROLL_NO = Student.ROLL_NO;

5. SQL Natural Join (?)

Natural join can join tables based on the common columns in the tables being joined. A natural join returns all rows by matching values in common columns having same name and data type of columns and that column should be present in both tables.

  • Both table must have at least one common column with same column name and same data type.

  • The two table are joined using Cross join.

  • DBMS will look for a common column with same name and data type. Tuples having exactly same values in common columns are kept in result.

Natural join Example

Look at the two tables below- Employee and Department


Problem: Find all Employees and their respective departments.

Solution Query: (Employee) ? (Department)

Conclusion

SQL joins are essential tools for anyone working with relational databases. Understanding the different types of joins in SQL, like INNER JOIN, LEFT OUTER JOIN, RIGHT JOIN, and FULL JOIN, allows us to combine and query data effectively. With the examples and syntax covered here, we should feel confident applying these SQL join types to our data to retrieve meaningful observations and manage complex queries with ease. Use these SQL join techniques to streamline our data handling and enhance our SQL skills.


FAQs

What are the 4 types of join SQL?

In SQL, the four main types of joins are: INNER JOINLEFT JOINRIGHT JOINFULL JOIN

What is a join in SQL?

A join in SQL is a relational database operation used to combine rows from two or more tables based on a related column between them.

What is the difference between INNER JOIN and LEFT JOIN?

The main difference between INNER JOIN and LEFT JOIN lies in how they handle unmatched rows. INNER JOIN focuses on matched rows only, while LEFT JOIN includes all rows from the left table, with NULLs where there is no match in the right table.

SQJH LOGO.png

Data Unlocked: Learn. Explore. Transform.

Phone

+254 702 304599

+254 768 944928

Email

Location

Nairobi, Kenya

StatQuestJourney Hub is a global online learning platform founded in 2022. Our mission is to empower students and professionals by providing top-tier education in data science, statistics, and programming, making you the driving force behind your career growth and success.

  • TikTok
  • Telegram
  • Facebook
  • X
  • LinkedIn
  • Instagram
  • Youtube

Get latest news delivered directly to your inbox

Thanks for submitting!

©2024 by StatQuestJourney Hub

bottom of page