Skip to content

Files

Latest commit

7f432d0 · Nov 9, 2014

History

History
17 lines (11 loc) · 709 Bytes

README.md

File metadata and controls

17 lines (11 loc) · 709 Bytes

Scraper of Licensed Restaurants in Hong Kong

Developed by Sammy Fung [email protected]

Food and Environmental Hygiene Department in Hong Kong Governement released a PSI dataset (in XML format) contains information of licensed restaurants in Hong Kong. This scraper is written in python, with use of scrapy web scraping framework.

Data.One Dataset: http://www.gov.hk/en/theme/psi/datasets/restaurantlicences.htm

Installation

$ pip install -r requirements.txt
$ git clone https://github.com/sammyfung/LP_Restaurants_Scrapy
$ cd LP_Restaurants_Scrapy/LP_Restaurants_Scrapy
$ scrapy crawl rlxml -t json -o -o restaurant_licenses.json