[Python] 시각화에 주로 쓰이는 라이브러리 3가지

Notice

Recent Posts

Recent Comments

Link

« 2025/06 »
일	월	화	수	목	금	토
1	2	3	4	5	6	7
8	9	10	11	12	13	14
15	16	17	18	19	20	21
22	23	24	25	26	27	28
29	30

Archives

Today

Total

관리 메뉴

잡동사니 블로그

[Python] 시각화에 주로 쓰이는 라이브러리 3가지 본문

Python

[Python] 시각화에 주로 쓰이는 라이브러리 3가지

코딩부대찌개 2023. 8. 26. 18:55

사용데이터셋

https://www.kaggle.com/competitions/m5-forecasting-accuracy/overview

M5 Forecasting - Accuracy | Kaggle

www.kaggle.com

import pandas as pd
import matplotlib.pyplot as plt
import seaborn as sns
import numpy as np
import plotly.express as px
url = 'm5-forecasting-accuracy'
calendar = pd.read_csv(f'{url}/calendar.csv')
selling_prices = pd.read_csv(f'{url}/sell_prices.csv')
sample_submission = pd.read_csv(f'{url}/sample_submission.csv')
sales_train_val = pd.read_csv(f'{url}/sales_train_validation.csv')
ids = sorted(list(set(sales_train_val['id'])))
d_cols = [c for c in sales_train_val.columns if 'd_' in c]
x_1 = sales_train_val.loc[sales_train_val['id'] == ids[2]].set_index('id')[d_cols].values[0]

데이터셋 다운로드 후, 불러오기

plt.style.use('ggplot')
plt.plot(np.arange(len(x_1)),x_1)
plt.show()

ggplot 스타일을 적용한 Matplotlib. 기본적인 라이브러리로 다양한 차트 생성 및 커스텀마이징이 가능함.

sns.set(style="whitegrid")
sns.lineplot(x=np.arange(len(x_1)),y=x_1)
plt.show()

Seaborn은 matplotlib기반으로 내장된 옵션으로 인해 비교적 쉽게 이쁜 스타일 만들어 내지만 커스텀마이징이 한정적임.

x=data,y=index와 같은 x= , y= 지정이 필요함.

fig=px.line(x=np.arange(len(x_1)),y=x_1)
fig.show()

Plotly는 위에 두 라이브러리랑 다르게 이미지 차트가 아닌 동적인 차트가 생성되며, 그래프 확대와 축소, 개인적으로 EDA하기 좋은 라이브러리라고 생각함. 단점은 가끔 느리다 정도?

'Python' 카테고리의 다른 글

[Python] pytorch와 sklearn의 train_test_split 활용하여 데이터 셋 나누기와 간단한 CNN (0)	2023.09.07
[Python] Selenium과 bs4를 이용한 크롤링 + Pyautogui (0)	2023.08.28
T-SNE 차원 축소 시각화 (0)	2022.09.08
[Kaggle] 신용카드 사기 분류(Credit Card Fraud Detection) (1)	2022.09.02
SMOTE를 활용한 Over sampling(오버샘플링) (5)	2022.09.01

'Python' Related Articles

잡동사니 블로그

[Python] 시각화에 주로 쓰이는 라이브러리 3가지 본문

[Python] 시각화에 주로 쓰이는 라이브러리 3가지

사용데이터셋

'Python' 카테고리의 다른 글

티스토리툴바