Python Pandas - 指示除第一次出现之外的重复索引值

要指示除第一次出现之外的重复索引值，请使用. 首先使用带有值的keep参数。index.duplicated()

首先，导入所需的库 -

import pandas as pd

创建具有一些重复项的索引 -

index = pd.Index(['Car','Bike','Airplane','Ship','Airplane'])

显示索引 -

print("Pandas Index with duplicates...\n",index)

将重复的索引值指示为 True，但第一次出现除外。将“keep”参数设置为“first” -

print("\nIndicating duplicate values except the first occurrence...\n", index.duplicated(keep='first'))

示例

以下是代码 -

import pandas as pd

# Creating the index with some duplicates
index = pd.Index(['Car','Bike','Airplane','Ship','Airplane'])

# Display the index
print("Pandas Index with duplicates...\n",index)

# Return the dtype of the data
print("\nThe dtype object...\n",index.dtype)

# get the dimensions of the data
print("\nGet the dimensions...\n",index.ndim)

# Indicate duplicate index values as True, except the first occurrence
# Set the "keep" 参数为 "first"
print("\nIndicating duplicate values except the first occurrence...\n", index.duplicated(keep='first'))

输出结果

这将产生以下代码 -

Pandas Index with duplicates...
Index(['Car', 'Bike', 'Airplane', 'Ship', 'Airplane'], dtype='object')

The dtype object...
object

Get the dimensions...
1

Indicating duplicate values except the first occurrence...
[False False False False True]

基础教程