본문 바로가기

Study Note54

Missing data handling In the previous post, I solved many different type of python questions on the Titanic dataset.Today, I want to analyze the way of calculating the average age, which was the 3rd question in the previous post.Since the 'Age' column contains null values, I will consider how to handle these missing values. The most commonly used methods for handling missing data in practice are as follows:Mean Impu.. 2024. 5. 2.
Union & Union All UNION: The `UNION` operator in SQL is used to combine the results of two or more SELECT statements into a single result set. It removes duplicate rows from the result set, so each row appears only once.UNION ALL: On the other hand, the `UNION ALL` operator also combines the results of two or more SELECT statements into a single result set, but it includes all rows from all SELECT statements, inc.. 2024. 5. 1.
Taitanic data analysis Today, I had a personal assignment. It involved coding questions related to passengers using Titanic data.The data was from kaggle. https://www.kaggle.com/competitions/titanic/data?select=train.csv Titanic - Machine Learning from Disaster | Kaggle www.kaggle.com    Data file  I wrote Python code based on this data. It's still quite lacking, and the way I approach coding is very inefficient and c.. 2024. 4. 30.
Mece logic tree MECE(Logical) stands for 'Mutually Exclusive, Collectively Exhaustive,' representing a principle of clear and systematic categorization in problem-solving and analysis. It is a guiding principle widely applied in various fields, particularly in management consulting and data analysis.MECE Logit Tree is a methodology used in predictive or classification modeling, where data is analyzed and specif.. 2024. 4. 29.
Python Data Types Python Data Types While I was studying Python, I realized that there are lots of different data types in Python.    Numerical data types are used to represent numbers. They primarily consist of integers and floating-point numbers, used for calculations and numerical operations.  In Python, two main types of numerical data are commonly used:1. Integer:   - Integers represent whole numbers without.. 2024. 4. 25.
AARRR Funnel Analysis AARRR is a model used in digital marketing and product management to analyze and optimize the customer lifecycle.  The AARRR model, shaped like a funnel narrowing downwards, aims to reduce customer running out rates as you move down the funnel.      AARRR stands for:Acquisition: This stage focuses on acquiring new users or customers. It involves activities such as driving traffic.. 2024. 4. 24.