The Royal Mail Ship Titanic was a British luxury passenger liner that resulted in one of the most famous tragedies in modern history. En route to New York City from Southampton, England, the massive boat sank after colliding with an iceberg on April 14-15, 1912. Out of the estimated 2,224 passengers, only 31% of the passengers survived. The goal of this project is to show who survived this catastrophic event.

Introduction

The dataset is based on the Kaggle Titanic Competition. Without changing the information, it has been modified to show different data cleaning techniques with Excel.

The Data

Cleaning The Data

The first step I took to clean the data was removing duplicates

Then I created randomized 6-digit passenger ID numbers

With IF formulas, I changed the survived, passenger class, and embarked location columns to words making them easier to read

Next I created four new columns for the passengers’ names with multiple text formats such as TRIM, TEXTAFTER, and CONCAT

Lastly I created age groups to categorize each passenger as Baby, Child, Adult, or Elderly

Here is the final data

Titanic Dashboard

The dashboard I created from the final spreadsheet

Previous
Previous

NYC Motor Vehicle Collisions with Tableau

Next
Next

Shipping Ports with SQL