Skip to content

Extremely slow repr() on small data frame #1746

@njsmith

Description

@njsmith

I have a small data frame here (obtained by slicing a much larger data frame with a MultiIndex):

In [6]: x.shape
Out[6]: (4, 32)

Its repr() looks like this:

In [7]: print repr(x)
<class 'pandas.core.frame.DataFrame'>
MultiIndex: 4 entries, ('arquant9.crw', 10, 336640) to ('arquant9.crw', 10, 336652)
Data columns:
lle     4  non-null values
lhz     4  non-null values
MiPf    4  non-null values
LLPf    4  non-null values
RLPf    4  non-null values
LMPf    4  non-null values
RMPf    4  non-null values
LDFr    4  non-null values
RDFr    4  non-null values
LLFr    4  non-null values
RLFr    4  non-null values
LMFr    4  non-null values
RMFr    4  non-null values
LMCe    4  non-null values
RMCe    4  non-null values
MiCe    4  non-null values
MiPa    4  non-null values
LDCe    4  non-null values
RDCe    4  non-null values
LDPa    4  non-null values
RDPa    4  non-null values
LMOc    4  non-null values
RMOc    4  non-null values
LLTe    4  non-null values
RLTe    4  non-null values
LLOc    4  non-null values
RLOc    4  non-null values
MiOc    4  non-null values
A2      4  non-null values
HEOG    4  non-null values
rle     4  non-null values
rhz     4  non-null values
dtypes: float64(32)

Producing this repr takes a bizarrely long time, it's very noticeable in interactive use:

In [8]: time repr(x)
CPU times: user 1.39 s, sys: 0.06 s, total: 1.45 s
Wall time: 1.36 s

The object is available here: http://vorpus.org/~njs/tmp/slow-repr-x.pickle

Metadata

Metadata

Assignees

No one assigned

    Labels

    PerformanceMemory or execution speed performance

    Type

    No type

    Projects

    No projects

    Milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions