Inspired by Derek Miller’s “Average Broadway”, I tried to look at the correlation between a musical/play’s setting(s) and its prestige and box office. I web scrapped IBDB.com, BroadwayLeague.com, TonyAwards.com, and Wikipedia, and then performed entity recognition and geoencoding (all in Python) to construct a PostgreSQL database which can be queried via PHP PDO. The project website is an interactive Leaflet.js map.
Started as final project for INSTG033 Server Programming and Structured Data with Dr. Oliver Duke-Williams and Dr. Antonis Bikakis at UCL.