(Python : Basic) pandas

Posted by : Python at Sep 1, 2020

Category : Python

numpy? pandas? 무슨 차이지?

numpy : 배열의 index가 숫자임
pandas : 배열의 index를 지정가능
- 그럼 dictionary아닌가? -> 역시 연산속도 때문에 pandas사용

import pandas as pd

a = pd.Series([1, 2, 3])
# 이렇게 하면 numpy와 동일하게 숫자로 indexing

a = pd.Series([1, 2, 3], index=['a', 'b', 'c'])
# 이런식의 선언이 가능

a = pd.Series({'a': 1, 'b': 2, 'c': 3})
# 이것도 가능

tips

a.head(2)
# 상위 2개만 출력

a = pd.Series([1, 2, 3, np.nan])

len(a) 
# 4

a.count() 
# 3 : nan을 제외한 숫자 리턴

a.unique()    
# unique한 값의 갯수 리턴

s3 = pd.Series([1, 2, 3, 4], index=['a', 'b', 'c', 'd']) 
s4 = pd.Series([4, 3, 2, 1], index=['d', 'c', 'b', 'a'])

print(s3 + s4)

a    2
b    4
c    6
d    8
dtype: int64