将Python中的特殊字符替换为“N/A”

我想将仅包含表情符号的所有行更改为df['Comments'][2]N/A。df['Comments'][:6]0 nice1 Insane32 😻😻❤️3 @bertelsen19864 20 or 30 mm rise on the Renthal Fatbar?5 Luckily I have one to 🔥💪🏻以下代码不会返回我期望的输出：df['Comments'].replace(';', ':', '!', '*', np.NaN)预期输出：df['Comments'][:6]0 nice1 Insane32 nan3 @bertelsen19864 20 or 30 mm rise on the Renthal Fatbar?5 Luckily I have one to 🔥💪🏻

查看完整描述

2 回答

人到中年有点甜

TA贡献1895条经验获得超7个赞

函数（remove_emoji）

尝试
安装第一个emoji库 -pip install emoji

import re

import emoji

df.Comments.apply(lambda x: x if (re.sub(r'(:[!_\-\w]+:)', '', emoji.demojize(x)) != "") else np.nan)

0 nice

1 Insane3

2 NaN

3 @bertelsen1986

4 Luckily I have one to 🔥💪🏻

Name: a, dtype: object

反对回复 2023-07-05

手掌心

TA贡献1942条经验获得超3个赞

您可以通过迭代每行中的 unicode 字符来检测仅包含表情符号的行（使用emoji和unicodedata包）：

df = {}

df['Comments'] = ["Test", "Hello 😉", "😉😉😉"]

import unicodedata

import numpy as np

from emoji import UNICODE_EMOJI

for i in range(len(df['Comments'])):

pure_emoji = True

for unicode_char in unicodedata.normalize('NFC', df['Comments'][i]):

if unicode_char not in UNICODE_EMOJI:

pure_emoji = False

break

if pure_emoji:

df['Comments'][i] = np.NaN

print(df['Comments'])

反对回复 2023-07-05

热搜

最近搜索清空

将Python中的特殊字符替换为“N/A”

将Python中的特殊字符替换为“N/A”

2 回答

添加回答