You have several options to set here.īut if you pay attention to the data, you’ll see you have an unfortunately far too common encoding problem. OpenRefine will immediately show you his best attempt to parse the file. So just open a web browser and go to Just click on choose files and the select the file where you exported your contacts from google. As you can see from the command output, the server will be listening on port 3333 of the localhost. OpenRefine is a java application that comes bundled with it’s own jetty based http server. Just download it uncompress it and start the server:Ġ0:02:22.831 Starting Server bound to '127.0.0.1:3333' (0ms) And ideal tool for this kind of stuff is openRefine (not so long ago known as google refine). Now we will have to process our dataset to find messy data and to try to fix it.
#HOW TO CHANGE PREFERENCES IN OPENREFINE TRIAL#
You better get used to it, working with data involves a lot of trial and error) Just go to your gmail account, choose contacts and then click on more, export.Ĭhoose Outlook csv format (don’t ask me why, but choosing Google csv just didn’t work. Just to make things easy, and to let you play with your own info, we will use our list of gmail contacts. Usually, finding the right dataset, and understanding how the information is organized and what can we do with it is a whole challenge by itself. Next we will import it into a CartoDB column, to expose it as a web service API.įinally, with some Javascript magic we will use leaflet to browse our contacts on a map. Then we will process them with OpenRefine to clean all the messy data, and to access an google map web service to geolocate each contact.
#HOW TO CHANGE PREFERENCES IN OPENREFINE SERIES#
In this series of articles we will take a look at a couple of really mind-blowing tools.įirst we will take our google contacts and export them as a csv file.
The good news, is that the tools available are pretty awesome, and they are getting even more awesome day by day. There’s not even a single tool that could help you through the whole process, on the contrary, you’ll have to spend some time learning and trying out different techniques. There’s no secret formula for successfully finding your way through a bunch of pdf, csv, excel, APIs or whatever else you might have to deal with. Playing around with data is not an easy task.