forked from pandas-dev/pandas
-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathv0.5.0.txt
60 lines (53 loc) · 4.04 KB
/
v0.5.0.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
.. _whatsnew_050:
v.0.5.0 (October 24, 2011)
--------------------------
New Features
~~~~~~~~~~~~
- :ref:`Added <basics.df_join>` ``DataFrame.align`` method with standard join options
- :ref:`Added <io.parse_dates>` ``parse_dates`` option to ``read_csv`` and ``read_table`` methods to optionally try to parse dates in the index columns
- :ref:`Added <io.parse_dates>` ``nrows``, ``chunksize``, and ``iterator`` arguments to ``read_csv`` and ``read_table``. The last two return a new ``TextParser`` class capable of lazily iterating through chunks of a flat file (GH242_)
- :ref:`Added <merging.multikey_join>` ability to join on multiple columns in ``DataFrame.join`` (GH214_)
- Added private ``_get_duplicates`` function to ``Index`` for identifying duplicate values more easily (ENH5c_)
- :ref:`Added <indexing.df_cols>` column attribute access to DataFrame.
- :ref:`Added <indexing.df_cols>` Python tab completion hook for DataFrame columns. (PR233_, GH230_)
- :ref:`Implemented <basics.describe>` ``Series.describe`` for Series containing objects (PR241_)
- :ref:`Added <merging.df_inner_join>` inner join option to ``DataFrame.join`` when joining on key(s) (GH248_)
- :ref:`Implemented <indexing.df_cols>` selecting DataFrame columns by passing a list to ``__getitem__`` (GH253_)
- :ref:`Implemented <indexing.set_ops>` & and | to intersect / union Index objects, respectively (GH261_)
- :ref:`Added<reshaping.pivot>` ``pivot_table`` convenience function to pandas namespace (GH234_)
- :ref:`Implemented <basics.rename_axis>` ``Panel.rename_axis`` function (GH243_)
- DataFrame will show index level names in console output (PR334_)
- :ref:`Implemented <indexing.take>` ``Panel.take``
- :ref:`Added<basics.console_output>` ``set_eng_float_format`` for alternate DataFrame floating point string formatting (ENH61_)
- :ref:`Added <indexing.set_index>` convenience ``set_index`` function for creating a DataFrame index from its existing columns
- :ref:`Implemented <groupby.multiindex>` ``groupby`` hierarchical index level name (GH223_)
- :ref:`Added <io.store_in_csv>` support for different delimiters in ``DataFrame.to_csv`` (PR244_)
- TODO: DOCS ABOUT TAKE METHODS
Performance Enhancements
~~~~~~~~~~~~~~~~~~~~~~~~
- VBENCH Major performance improvements in file parsing functions ``read_csv`` and ``read_table``
- VBENCH Added Cython function for converting tuples to ndarray very fast. Speeds up many MultiIndex-related operations
- VBENCH Refactored merging / joining code into a tidy class and disabled unnecessary computations in the float/object case, thus getting about 10% better performance (GH211_)
- VBENCH Improved speed of ``DataFrame.xs`` on mixed-type DataFrame objects by about 5x, regression from 0.3.0 (GH215_)
- VBENCH With new ``DataFrame.align`` method, speeding up binary operations between differently-indexed DataFrame objects by 10-25%.
- VBENCH Significantly sped up conversion of nested dict into DataFrame (GH212_)
- VBENCH Significantly speed up DataFrame ``__repr__`` and ``count`` on large mixed-type DataFrame objects
.. _GH214: https://github.com/pydata/pandas/issues/214
.. _GH248: https://github.com/pydata/pandas/issues/248
.. _GH253: https://github.com/pydata/pandas/issues/253
.. _GH261: https://github.com/pydata/pandas/issues/261
.. _GH234: https://github.com/pydata/pandas/issues/234
.. _GH243: https://github.com/pydata/pandas/issues/243
.. _GH223: https://github.com/pydata/pandas/issues/223
.. _PR244: https://github.com/pydata/pandas/pull/244
.. _PR233: https://github.com/pydata/pandas/pull/233
.. _GH230: https://github.com/pydata/pandas/issues/230
.. _PR241: https://github.com/pydata/pandas/pull/241
.. _GH242: https://github.com/pydata/pandas/issues/242
.. _GH212: https://github.com/pydata/pandas/issues/212
.. _GH211: https://github.com/pydata/pandas/issues/211
.. _GH215: https://github.com/pydata/pandas/issues/215
.. _GH213: https://github.com/pydata/pandas/issues/213
.. _ENH61: https://github.com/pydata/pandas/commit/6141961
.. _PR334: https://github.com/pydata/pandas/pull/334
.. _ENH5c: https://github.com/pydata/pandas/commit/5ca6ff5d822ee4ddef1ec0d87b6d83d8b4bbd3eb