首页猿问如何有条件地拆分熊猫的列

如何有条件地拆分熊猫的列

Python

湖上湖 2021-09-11 17:58:58

我有一个日志 df，在那个 df 我有列描述。好像。DescriptionMachine x : Turn offAnother action hereAnother action hereMachine y : Turn offMachine x : Turn onAnother action here我只需要用“：”分割行喜欢：Description Machine ActionMachine x : Turn off Machine x Turn offAnother action hereAnother action hereMachine y : Turn off Machine y Turn offMachine x : Turn on Machine x Turn onAnother action here我已经尝试过：s = df["Description"].apply(lambda x:x.split(":"))df["Action"] = s.apply(lambda x: x[1])df["Machine"] = s.apply(lambda x: x[0])还有一些带有“startswith”的东西。

查看完整描述

3 回答

qq_花开花谢_0

TA贡献1835条经验获得超7个赞

您可以使用str.extract合适的regex. 这将找到周围的所有值:（也去除冒号周围的空格）：

df[['Machine', 'Action']] = df.Description.str.extract('(.*) : (.*)',expand=True)

>>> df

Description Machine Action

0 Machine x : Turn off Machine x Turn off

1 Another action here NaN NaN

2 Another action here NaN NaN

3 Machine y : Turn off Machine y Turn off

4 Machine x : Turn on Machine x Turn on

5 Another action here NaN NaN

# df[['Machine', 'Action']] = df.Description.str.extract('(.*) : (.*)',expand=True).fillna('')

反对回复 2021-09-11

婷婷同学_

TA贡献1844条经验获得超8个赞

只需使用split与expand=True

df[['Machine', 'Action']] =df.Description.str.split(':',expand=True).dropna()

Description Machine Action

0 Machine x : Turn off Machine x Turn off

1 Another action here NaN NaN

2 Another action here NaN NaN

3 Machine y : Turn off Machine y Turn off

4 Machine x : Turn on Machine x Turn on

5 Another action here NaN NaN

反对回复 2021-09-11

陪伴而非守候

TA贡献1757条经验获得超8个赞

给定一个数据框

>>> df

Description

0 Machine x : Turn off

1 Another action here

2 Another action here

3 Machine y : Turn off

4 Machine x : Turn on

5 Another action here

我会通过Series.str.split(splitter, expand=True).

>>> has_colon = df['Description'].str.contains(':')

>>> df[['Machine', 'Action']] = df.loc[has_colon, 'Description'].str.split('\s*:\s*', expand=True)

>>> df

Description Machine Action

0 Machine x : Turn off Machine x Turn off

1 Another action here NaN NaN

2 Another action here NaN NaN

3 Machine y : Turn off Machine y Turn off

4 Machine x : Turn on Machine x Turn on

5 Another action here NaN NaN

如果您更喜欢空字符串，可以NaN通过以下方式替换单元格

>>> df.fillna('')

Description Machine Action

0 Machine x : Turn off Machine x Turn off

1 Another action here

2 Another action here

3 Machine y : Turn off Machine y Turn off

4 Machine x : Turn on Machine x Turn on

5 Another action here

反对回复 2021-09-11

3 回答
0 关注
161 浏览

关注

添加回答

0/150

提交

取消

热搜

最近搜索清空

如何有条件地拆分熊猫的列

如何有条件地拆分熊猫的列

3 回答

添加回答