Assignment: Data and System Integration
Case Study: Industrial Internet for GE
After evaluating various platforms, GE finally decided to adopt a Service Oriented Architecture (SOA) for its future IT infrastructure. GE was impressed with your Template based, Reference Architecture approach set out in your Task 1 ‘Industrial Internet' Report and has accepted your recommendations regarding:
•	Computing and storage infrastructure design,
•	Application / service integration, and
•	Information integration.
However GE Executives would like to gain a deeper understanding of the Technologies behind your recommendation and have asked for a small specific demonstration of these technologies along with a brief explanation of the concepts and principles of how it works.
Because GE have medical clinic related applications in many clinics around Australia, GE Executives would like a demonstration of keeping track of demographic information for each medical clinic location. The demo enables GE employees to browser search the locations of medical clinics in Google Maps using a postcode and retrieve an example of current demographic information along with the map. There are three major components in the demo system:
1.	Data integration demo. In this demo, there are two data source files: "clinics.csv" contains the information about fictional clinics, and "locations.xml" contains the information about the location coordinates for each clinic. A Python scrip file with the name "data_merger.py" should be implemented to read these two data files. Using the Python petl framework you are required to:
a.	clean the data in the "clinics.csv". This involves using python and Regex to standardise email addresses so they are usable as a html link, and
b.	output the merged and cleaned data into a CSV file with the name "clinic_locations.csv".
2.	RESTful Web service server demo. In this demo, you are required to build a RESTful Web service which supports a ‘getlocation' query from the client browser similar to "/getlocation?postcode=xxxx". The server Web Service will return a JSON object with five attributes (clinicname, latitude, longitude, email and state). The data for this JSON object will be retrieved from the "clinic_locations.csv" file created in component 1. The framework Bottle is required for this implementation. A Python scrip file with the name "clinic_locator.py" will contain this RESTful Web Service Demo.
3.	Mashup demo. In this demo, you are required to build a mashup application which accepts a postcode as the input. If a medical clinic in that suburb can be found, then its location is displayed in Google Map. Otherwise, the default clinic (with postcode 2000) is displayed. A HTML file "clinic_map.html" should be implemented. In this file, a text field is provided at the page top to accept user's input of postcode. In addition, a Submit button will trigger the "getlocation" Web Service to retrieve its corresponding latitude and longitude, and highlight and centre the location in Google map.
In addition to highlighting the positon on the map, a user is able to click on the marker bubble and see an information window containing:
a.	suburb specific demographic information or a link to that information, and
b.	a link to the email address for that clinic. After clicking on the email link, the user's email application must pop up and automatically insert the subject line "Clinic Enquiry".
Important Note: For ease of demonstration to GE executives, your code must be self- contained. In addition to using Standards based HTML and Python, petl, Bottle and parser are the only additional frameworks you should need. If you use others they must be provided and not require installation.
Attachment:- Data-Files.rar