Skip to content

Notebooks

Kerby Shedden edited this page Feb 8, 2015 · 28 revisions

Data management and analysis in Python

This is a collection of IPython notebooks demonstrating a variety of techniques for data management and analysis.

Language and library basics

This section contains notebooks that are not tied to specific data sets.

US 2012 Medicare payment data

The data can be obtained here. The first script below can be used to produce state-level data files that are used in the subsequent scripts.

Comparison of gene expression in ulcerative colitis and Chron's disease patients

The data are available from the NCBI Gene Expression Omnibus (GEO) here.

Analysis of nation-level fertility trends

The data are available from the World Bank here.

National Health and Nutrition Examination Survey (NHANES)

The data are available from the NHANES web site: http://www.cdc.gov/nchs/nhanes.htm

Births and deaths of notable individuals

The raw data are available from the supplementary materials of this article.

Global Surface Network weather data

The raw data are available here.

  • Script to generate plots of annual series of climate variables