Python数据分析系列（五）：python数据结构

Python数据分析系列（五）：python数据结构 — Pandas中的Series使用

server/2024/9/23 11:12:29/

文章目录

前言
一、Series创建与属性
二、Series的索引
三、Series的基本运算
四、Series的数据对齐
五、Series操作
- 1、判断是否是唯一值
- 2、判断值
- 3、值计数
- 4、缺失值处理
- - 1、滤除缺失数据
  - 2、填充缺失数据
- 5、日期时间列中提取月份和年份

前言

Pandas 是基于 NumPy 的一种工具，该工具是为了解决数据分析任务而创建的。其中Series和DataFrame是两种最主要的数据结构，本文主要介绍Series的使用。

一、Series创建与属性

基本特征：
- 类似一维数组的对象
- 由数据和索引组成
属性：
- 索引(index)：对应是最左侧那一列。
- 数据(values)：每一个索引的右侧对应一个值。
- name：Series对象及其索引(index)都有一个name属性。

示例1：

python">import pandas as pd
aSeries=pd.Series([1,2,'a'])
aSeries
# 输出：
# 0    1
# 1    2
# 2    a
# dtype: object

Series字符串表现形式为：索引在左边，值在右边。

示例2：自定义Series的index。

python">import pandas as pd
aSeries=pd.Series(['apple','orange','lemon'],index=[1,2,3])
aSeries
# 输出：
# 1     apple
# 2    orange
# 3     lemon
# dtype: objectaSeries.index
# 输出：
# Int64Index([1, 2, 3], dtype='int64')aSeries.index=[4,5,6] #Series索引可以通过赋值的方式就地修改
aSeries
# 输出：
# 4     apple
# 5    orange
# 6     lemon
# dtype: objectaSeries.values
# 输出：
# array(['apple', 'orange', 'lemon'], dtype=object)

示例3：如果数据被存放在一个python字典中，也可以直接通过这个字典来创建Series。

python">import numpy as np
data={'apple':'8.4','orange':'7','lemon':'4'} 
aSeries=pd.Series(data)
aSeries
# 输出：
# apple     8.4
# orange      7
# lemon       4
# dtype: object

示例4：Series及其索引(index)的name属性

python">import pandas as pd
aSeries=pd.Series(['apple','orange','lemon'],index=[1,2,3])
aSeries.name="price"
aSeries.index.name="id"
aSeries
# 输出：
# id
# 1     apple
# 2    orange
# 3     lemon
# Name: price, dtype: object

二、Series的索引

示例1：索引单个值

python">import pandas as pd
aSeries=pd.Series(['apple','orange','lemon'],index=['a','b','c'])
aSeries['a']
# 输出：
# 'apple'aSeries['c']='peach' #Series索引对应的数据可以通过赋值的方式就地修改
aSeries
# 输出：
# a     apple
# b    orange
# c     peach
# dtype: object

示例2：索引一组值

python">import pandas as pd
aSeries=pd.Series(['apple','orange','lemon'],index=['a','b','c'])
aSeries[['c','a']]
# 输出：
# c    peach
# a    apple
# dtype: object

示例3：层次化索引

python">import pandas as pd
aSeries= pd.Series(np.random.randn(10),index

Python数据分析系列（五）：python数据结构 — Pandas中的Series使用

文章目录

前言

一、Series创建与属性

二、Series的索引

相关文章

商城数据库88章表80~83

IOPS ；MB/S分别在衡量RAN/SEQ R/W的原因说明

费曼学习法个人总结-1

新科技辅助器具赋能视障生活：让盲人出行融入日常

火车头采集怎么发布到Wordpress

XTuner微调LLM：1.8B、多模态和Agent-笔记四

每日一题：Redis 中的内存淘汰机制、有哪些内存淘汰策略❓

共享旅游卡项目如何做线上运营？分享运营的3个核心点！