The Royal Mail Ship Titanic was a British luxury passenger liner that resulted in one of the most famous tragedies in modern history. En route to New York City from Southampton, England, the massive boat sank after colliding with an iceberg on April 14-15, 1912. Out of the estimated 2,224 passengers, only 31% of the passengers survived. The goal of this project is to show who survived this catastrophic event.
Introduction
The dataset is based on the Kaggle Titanic Competition. Without changing the information, it has been modified to show different data cleaning techniques with Excel.
The Data
Cleaning The Data
The first step I took to clean the data was removing duplicates
Then I created randomized 6-digit passenger ID numbers
With IF formulas, I changed the survived, passenger class, and embarked location columns to words making them easier to read
Next I created four new columns for the passengers’ names with multiple text formats such as TRIM, TEXTAFTER, and CONCAT
Lastly I created age groups to categorize each passenger as Baby, Child, Adult, or Elderly
Here is the final data
Titanic Dashboard
The dashboard I created from the final spreadsheet