Apply your EDA skills to analyze real-world e-commerce data, uncover patterns and insights using univariate, bivariate, and multivariate techniques, and create compelling visualizations that tell a data story.
In this comprehensive assignment, you will work as a Data Analyst at TechMetrics Inc., a growing e-commerce analytics company. Your manager needs you to perform exploratory data analysis on customer, sales, and website data to understand business performance and identify opportunities for growth.
"Welcome to TechMetrics Inc.! We have been collecting data from our e-commerce platform for the past year, and our executive team is preparing for the quarterly business review. We need a comprehensive EDA report to understand our business better.
I need you to investigate the following areas:
The board presentation is in two weeks. I need you to create a thorough EDA report with clear visualizations and actionable insights. Focus on telling the data story - what patterns emerge, what surprises you, and what should we do about it.
Looking forward to your analysis!"
- Jennifer Martinez, VP of Analytics
Download the three datasets below. These contain real business data from TechMetrics Inc.
Customer profiles including demographics, account details, and purchase history metrics.
customer_id, signup_date, age, gender, regionmembership_type, total_purchases, total_spent, avg_order_valuedays_since_last_purchase, satisfaction_score, referral_countTransaction records with order details, products, quantities, and revenue data.
order_id, customer_id, order_date, product_categoryproduct_name, quantity, unit_price, total_amountdiscount_applied, payment_method, shipping_regionWebsite analytics including page views, session duration, bounce rates, and conversion data.
session_id, customer_id, visit_date, traffic_sourcedevice_type, pages_viewed, session_duration, bounceconverted, cart_value, product_viewsdf.info() and df.describe()univariate_distributions.pngbivariate_relationships.pngcategorical_comparisons.pngcorrelation_heatmap.pngtemporal_patterns.pngkey_insights_dashboard.png| Component | Points | Criteria |
|---|---|---|
| Data Overview | 10 | Complete data inspection, quality assessment, proper merging |
| Univariate Analysis | 20 | Thorough distribution analysis, appropriate statistics, quality visualizations |
| Bivariate Analysis | 25 | All three relationship types analyzed, proper techniques, clear interpretations |
| Correlation Analysis | 15 | Complete matrix, professional heatmap, meaningful interpretation |
| Temporal Analysis | 15 | Time series patterns identified, trends visualized, insights generated |
| Business Insights | 15 | Clear insights, actionable recommendations, well-written report |
| Total | 100 |
Create a public GitHub repository with the exact name shown below, add all required files, and submit through the submission portal.
github.com/<your-username>/techmetrics-eda
techmetrics-eda/
├── eda_analysis.ipynb
├── eda_report.pdf
├── visualizations/
│ ├── univariate_distributions.png
│ ├── bivariate_relationships.png
│ ├── categorical_comparisons.png
│ ├── correlation_heatmap.png
│ ├── temporal_patterns.png
│ └── key_insights_dashboard.png
├── data/
│ └── (downloaded CSV files)
└── README.md
All files are required. Submission will fail if any file is missing.