본문 바로가기

Study Note/Python25

Join, Merge, Concat, Append, and Pivot table 1. Using the join MethodThe join method in pandas is used to combine two dataframes based on their index. Here is the syntax: DataFrame.join(other, on=None, how='left', lsuffix='', rsuffix='', sort=False) other: The dataframe to join with.on: Column or index level names to join on. Default is the index.how: Type of join to perform ('left', 'right', 'outer', 'inner'). Default is 'left'.lsuffix: S.. 2024. 5. 23.
The difference between deep learning and machine learning The difference between deep learning and machine learning Artificial Intelligence: Human capabilities and functions -> ComputerMachine Learning: Learns patterns, rules, and characteristics on its own using large amounts of data. (Data-dependent - if the data is wrong, it will fail) / Advantage: Just provide the data [because it learns by itself] -> Data must be very well refined.Deep Learning: D.. 2024. 5. 21.
Seaborn plot (Scatter, Hist, and Box) Today, I plan to practice again by using Seaborn to recreate the plots I made last time.  Seaborn is a Python package for data visualization, built on top of Matplotlib. It provides a high-level interface for creating attractive and informative statistical graphics.Seaborn is used to visualize statistical data, particularly to explore relationships between variables. It offers a wide range of pl.. 2024. 5. 16.
Library (Pandas, Seaborn, Matplotlib, Numpy) - Iris Data When using Python, in addition to basic functions, we need to utilize various libraries. Especially, Pandas, Seaborn, Matplotlib, and Numpy are the most essential libraries when performing data analysis.  1. Pandas:   - Purpose: Pandas is used for data manipulation and analysis. It provides data structures to efficiently store and manipulate data. Its main data structures are Series (a one-dimen.. 2024. 5. 14.
Enumerate() The `enumerate()` function  takes an iterable object (such as a list, tuple, or string) as input and is used to iterate over it while keeping track of the index and the corresponding element. It's commonly used in for loops when you need to access both the index and the element of each iteration.The syntax of the `enumerate()` function is as follows:enumerate(iterable, start=0) - `iterable`: The.. 2024. 5. 8.
Missing data handling In the previous post, I solved many different type of python questions on the Titanic dataset.Today, I want to analyze the way of calculating the average age, which was the 3rd question in the previous post.Since the 'Age' column contains null values, I will consider how to handle these missing values. The most commonly used methods for handling missing data in practice are as follows:Mean Impu.. 2024. 5. 2.