Lab Report: Organizing Data

Lab Summary

Connected to the world database, filtered the Australia and New Zealand region, ordered countries by population, summarized the region with GROUP BY and SUM(), generated a running total with SUM() OVER(), added ranking with RANK() OVER(), and solved the challenge by ranking every country inside its own region from largest to smallest population.

Filtered Region Query

Started with one region to make the ordering and grouping behavior easier to inspect.

Grouped and Windowed Results

Compared one-row grouped output against row-by-row running totals and ranks.

Challenge Ranking

Expanded the ranking logic to the full dataset and ordered the final output by region and rank.

Step-by-Step Walkthrough

The lab started with a filtered region and then gradually added grouping and ranking logic until the result set became analytical instead of descriptive only.

Connect and filter the Australia and New Zealand region

Opened the Command Host, switched to root, and connected to MariaDB with the standard lab credentials.
Verified that the world database was available and queried world.country for rows where Region = 'Australia and New Zealand'.
Ordered the filtered rows by Population DESC so the most populated country in the region appeared first.

Starting with one region makes it easier to understand what each grouping or ranking clause is doing before applying the same pattern to the full dataset.

Use GROUP BY, running totals, and rank inside one region

Applied GROUP BY Region together with SUM(Population) to calculate the total population of Australia and New Zealand as a single grouped result.
Used SUM(Population) OVER(partition by Region ORDER BY Population) to generate a running total while still keeping each country visible in the output.
Added RANK() OVER(partition by Region ORDER BY Population) to number the countries according to population within the same region.

GROUP BY summary and OVER ranking for Australia and New Zealand — Terminal: The output first shows the countries in Australia and New Zealand ordered by population, then the grouped regional total, then the running total and ranking generated with window functions.

Rank every country within its own region

Solved the challenge with RANK() OVER(partition by Region ORDER BY Population DESC) so each country received a rank relative to the other countries in its region.
Ordered the final result with ORDER BY Region, Ranked so the rows stayed grouped and easy to review.
Confirmed that the ranking restarted at 1 whenever the region changed, which is the expected behavior of the partition.

Challenge ranking countries by region — Terminal: The challenge query ranks countries from largest to smallest population inside each region, and the rank resets when the region changes.

Query Reference

Main clauses and window functions used to summarize and rank the country data.

sql

`GROUP BY Region`

Groups related rows so an aggregate function can return one summary result per region.

sql

`SUM(Population)`

Calculates the total population for the rows included in the current group or window.

sql

`SUM(Population) OVER(partition by Region ORDER BY Population)`

Returns a running total by region while still showing each original row.

sql

`RANK() OVER(partition by Region ORDER BY Population)`

Assigns a rank number within each region according to population order.

sql

`RANK() OVER(partition by Region ORDER BY Population DESC)`

Ranks the largest population as position 1 inside each region.

sql

`ORDER BY Region, Ranked`

Sorts the final challenge result so the rows remain grouped by region and rank.

Key Learnings

What Was Actually Learned

✓

GROUP BY summarizes related rows into one grouped result, which is useful for totals by category.

✓

Window functions with OVER() keep the original rows visible while still adding calculations such as running totals and rank numbers.

✓

RANK() becomes much more useful when it is partitioned by region, because the position resets correctly for each group.

Technical Conclusion

This lab showed the practical difference between aggregation and analytical windowing. A grouped query reduces many rows into one summary, while a windowed query keeps each row visible and enriches it with extra context.

The challenge made that distinction very clear. Ranking countries by region required preserving every row, so the OVER() clause was the right tool instead of collapsing the data with GROUP BY.