Hapmap download

[+] | [-]

You don’t need to do this to prepare your data for a candidate gene study. You only need it when you want to download the source hapmap.org text data files.

Downloading hapmap data

You can download the hapmap.org data with a program “ge-fetch-hapmap”. It’s a script written in Python. It runs on GNU/Linux and Microsoft Windows.

$ ./ge-fetch-hapmap --help
Usage: ge-fetch-hapmap [options]

Options:
  -h, --help            show this help message and exit
  -c FILE, --config=FILE
                        Configuration file
  -p PROXY, --proxy=PROXY
                        HTTP Proxy

This program uses a configuration file, hapmap.ini.


; Configuration for hapmap.org data download.
;
; Author: Maciej Blizinski <maciej.blizinski@gmail.com>

[hapmap]
; The base URL
Base URL: http://www.hapmap.org/downloads/phasing/2006-07_phaseII/phased/

; Chromosomes to download, for example: 1-22, X
; Chromosomes: 1-3, 5, X
Chromosomes: 1-22, X

; Populations, in form of FILE_NAME(ANALYSIS_NAME), where FILE_NAME
; denotes part of the hapmap filename to download and ANALYSIS_NAME
; denotes the name used in analysis.
Populations: CEU (Eur), JPT+CHB (Asian), YRI (Afr)

; Directory for hapmap data
Data directory: hapmap-data

; Directory for original hapmap files
Download directory: download

After the script has finished, the download directory will contain compressed and uncompressed data files. If you download all chromosomes, the directory will occupy about 800MB of disk space.

Last modified on Tue Feb 26 07:53:16 +0000 2008