Skip to content

BUG: ValueError on groupby with categoricals #34951

Closed
@LukasGelbmann

Description

@LukasGelbmann
  • I have checked that this issue has not already been reported.

  • I have confirmed this bug exists on the latest version of pandas.

  • (optional) I have confirmed this bug exists on the master branch of pandas.


In specific situations involving categorical columns, a groupby() on two or more columns runs into an error:

Code Sample

import pandas as pd
col = pd.Categorical([0, 1])
df = pd.DataFrame({'A': col, 'B': col, 'C': col})
grouped = df.groupby(['A', 'B']).first()
# ValueError: Shape of passed values is (4, 1), indices imply (2, 1)

Expected Output

Expected output would be something like:

>>> grouped
       C
A B     
0 0    0
  1  NaN
1 0  NaN
  1    1

Instead an exception is thrown, with the error message shown above.

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions