Change Log

All noteable changes to this project are documented on this page. This project adheres to Semantic Versioning.

0.2.1

2022-05-23

  • Update requirement to beautifulsoup4 instead of bs4

0.2.0

2022-05-20

  • New minimum python version: Python 3.7.1

  • Added support for IPUMS CPS extracts with CpsExtract

  • Added CollectionInformation class to access collection-level information about IPUMS data.

  • Added ability to download Stata, SPSS, SAS, and R command files with data files download_extract().

  • Added extract_to_dict() and extract_from_dict() method to enable easy exporting of extract objects to dictionary objects and creation of extract objects from dictionaries.

  • Added define_extract_from_ddi() method to re-create an IPUMS extract object from a DDI codebook.

  • Added convenience method save_extract_as_json() to save IPUMS extract definition to json file.

  • Added convenience method define_extract_from_json() to read an IPUMS extract definition from a json file.

  • Added IpumsExtractNotSubmitted() exception. This will be raised when attempting to retrieve an extract id or download link from a extract that has not been submitted to the IPUMS extract engine.

  • Added get_all_types() method to access all types of ddi codebook variables in an easy way.

  • Added parameter string_pyarrow to get_all_types() method. If this parameter is set to True and used in conjunction with parameter type_format=”pandas_type” or type_format`=”pandas_type_efficient”, then the string column dtype (pandas.StringDtype()) is overriden with pandas.StringDtype(storage=”pyarrow”). Useful for users who want to convert an IPUMS extract in csv format to parquet format. The dictionary returned by this method can then be used in the dtype argument of read_microdata() or read_microdata_chunked().

  • Added pandas_type_efficient(). This type format is more efficient than pandas_type and is a sort of mix between pandas_type and numpy_type. Integer and float variables are coded as numpy.float64, string as pandas.StringDtype().

0.1.0

2021-11-30

  • This is the initial version of ipumspy.