[Python][pandas] Exploring pandas in Depth

개발 Code/파이썬 Python

[Python][pandas] Exploring pandas in Depth

5hr1rnp 2025. 2. 11. 17:31

What is Pandas?

Pandas is an open-source Python library designed for data manipulation and analysis. It was developed by Wes McKinney in 2008 when he saw the need for an efficient and intuitive tool to handle financial data. The name "Pandas" originates from "PANel DAta," reflecting its focus on handling multidimensional data structures.

Built on top of NumPy, Pandas provides a powerful and flexible framework for working with structured data. Whether you're a beginner or an experienced data scientist, Pandas offers essential tools to process and analyze data efficiently.

It provides two primary data structures:

Series: A one-dimensional labeled array that can store any data type.
DataFrame: A two-dimensional, labeled, and resizable data structure similar to spreadsheets or SQL tables.

Pandas is widely used in data science due to its speed, flexibility, and expressive data structures. It has become a fundamental tool for real-world data analysis, enabling advanced data manipulation. As one of the most powerful open-source data analysis libraries, Pandas continues to evolve and is widely used across multiple programming languages.

Key Features of Pandas

1. Handling Missing Data

Easily manage missing values such as NaN, NA, or NaT.

2. Resizable Data Structures

Insert or delete columns in DataFrames and higher-dimensional objects.

3. Automatic Data Alignment

Aligns labels explicitly or automatically.

4. Flexible Data Grouping

Perform aggregation and transformation operations by grouping data.

5. Extensive Data Transformation

Convert Python and NumPy data structures into DataFrame objects effortlessly.

6. Slicing and Indexing

Supports slicing, fancy indexing, and subsetting of large datasets.

7. Merging and Joining

Intuitively merge and join datasets.

8. Reshaping Data

Supports reshaping and pivoting datasets.

9. Hierarchical Labeling

Assign multiple labels to axes.

10. Powerful I/O Tools

Load and store data in various formats such as CSV, Excel, and databases.

11. Time Series Support

Generate date ranges, calculate moving averages, and transform time series data.

728x90

Getting Started with Pandas

Pandas can be installed using pip or conda:

# PyPI
pip install pandas

# conda
conda install -c conda-forge pandas

Basic Examples

Creating a Series

A Pandas Series is a one-dimensional labeled array that can store any data type, such as integers, strings, or Python objects.

# pandas series
import pandas as pd 

data = [10, 20, 30, 40]
series = pd.Series(data, index=['a', 'b', 'c', 'd'])
print(series)

# output
# a    10
# b    20
# c    30
# d    40
# dtype: int64

Creating a DataFrame

A DataFrame is a two-dimensional structure consisting of rows and columns, similar to an Excel or SQL table.

# pandas dataframe
import pandas as pd

data = {
    "Name": ["Kim Seoul", "Lee Jeonju", "Song Gongju"],
    "Age": [25, 30, 35],
    "City": ["Seoul", "Jeonju", "Gongju"]
}
df = pd.DataFrame(data)
print(df)

# output
#	Name  	Age   City
# 0  Kim Seoul   25  Seoul
# 1  Lee Jeonju  30  Jeonju
# 2  Song Gongju 35  Gongju

When to Use Pandas?

Pandas is ideal for the following tasks:

Data Cleaning & Preprocessing
Exploratory Data Analysis (EDA)
Working with Time Series or Structured Data

However, for handling large-scale data, Dask or PySpark may be more suitable.

Pandas remains one of the most essential libraries in the data science ecosystem, providing efficient tools for working with structured data.

저작자표시 비영리 변경금지 (새창열림)

'개발 Code > 파이썬 Python' 카테고리의 다른 글

[Python][program] CLI ASCII art 발렌타인 메세지 쓰기 (0)	2025.02.12
[Python][pandas] Loading Data - CSV (0)	2025.02.11
[Python][numpy] Numpy로 효율적인 데이터 샘플링 및 난수 생성 (0)	2025.02.09
[Python][numpy] Numpy 배열 저장 및 불러오기 (0)	2025.02.09
[Python][numpy] Numpy 기초부터 활용까지 (0)	2025.02.08

현재글[Python][pandas] Exploring pandas in Depth

🐶짱구와 꾜미 집에 놀러온 용뇽이🦊

일상 속에서 발견한 작은 언어의 재미, 스쳐 지나간 풍경과 맛있는 기억들, 그리고 배움 속에서 얻은 깨달음을 나누는 공간. A place to share the joy of language, fleeting landscapes and delightful flavors, and the insights gained through learning.

250x250

Buy me a coffee

일	월	화	수	목	금	토
1	2	3	4	5	6	7
8	9	10	11	12	13	14
15	16	17	18	19	20	21
22	23	24	25	26	27	28
29	30

내 블로그 - 관리자 홈 전환	`Q` `Q`
새 글 쓰기	`W` `W`

글 수정 (권한 있는 경우)	`E` `E`
댓글 영역으로 이동	`C` `C`

이 페이지의 URL 복사	`S` `S`
맨 위로 이동	`T` `T`
티스토리 홈 이동	`H` `H`
단축키 안내	`Shift` + `/` `⇧` + `/`

🐶짱구와 꾜미 집에 놀러온 용뇽이🦊