Tuesday, August 2, 2011

Processing the Groupon API with Altova MapForce

We often think of a data integration project as a translation from one singular data input file to some other data set, but Altova MapForce lets you greatly expand the concept of an input file.

For instance, the MergeMultipleFiles.mfd example installed with MapForce illustrates how you can use a filename with wildcard characters to merge multiple input files into a single output.

MapForce MergeMultipleFiles.mfd example

A MapForce mapping input doesn’t even need to be a physical file – it can be a URL that returns predictable structured data, like the APIs for popular Web sites like Groupon and many others.

In this blog post we’ll describe how to use Altova tools to retrieve, filter, analyze, and present data available from a Web-based API, using Groupon as an example. If you want to follow along yourself, you will first need to visit http://www.groupon.com/pages/api to request your personal Groupon API client key.

The Problem: All Deals Are Local

The Groupon Web site and email subscriptions are great for finding deals in your local neighborhood, but what if you’re looking for a deal to use on an upcoming vacation, or for a gift for friends or family across the country? Sure, you could enter each location manually at the Groupon Web page, but that’s so last century. Let’s use the Altova MissionKit to automate things.

The Groupon API offers two URL queries that return data in .json or .xml formats: the first returns a list of all Groupon localities (called divisions), and the second returns current deals information for one named division.

If we want to see all the deals for more than one division, we need to resolve multiple URLs and aggregate the data into a single result. Yes, MapForce can do that!

First We Need a Schema

The Groupon API documentation describes the elements that will be returned by our requests, but doesn’t provide an XML Schema. That’s okay, we can use MapForce to generate one.

All we have to do is open a new mapping design and choose Insert XML Schema/File, then click the Switch to URL button. Now we can enter the URL to retrieve the Groupon divisions list:

Inserting a new component into a MapForce mapping by URL

When we click the Open button MapForce offers to generate the schema:

MapForce offers to generate an XML Schema

When we click Yes, the File / Save dialog opens. I saved the schema as divisions.xsd, and the mapping with the new XML Schema inserted looks like this:

Generated .xsd as a new component in a MapForce design

And the Properties dialog for the XML Schema component automatically contains the API /divisions URL as the Input XML File:

Component properties for the generated .xsd

Check the Work

We want to filter the Groupon divisions data to build a list of id names to use for deal queries for each locality. But before we go any further, now might be a good time to apply the text file trick from the Quick Solution for Complicated Functions blog post to look at the id values. When we insert the text file and connect the divisions and id schema elements, the mapping looks like this:

MapForce design with text file to preview output

We connected the division element to Rows in the text file in order to generate a new row in the text file for each unique division, so that Field1 in each row will hold the id. Clicking the Output button now generates this result:

MapForce Output window

All we need to do is apply the concat string function to build the list of /deal URLs for all division IDs. The next step in the mapping looks like this:

Using the MapForce concat function to build a string

Rolling the cursor over the constant connected to value1 of the concat function displays its full definition:

Definition of a MapForce string constant

When we click the Output button to execute the mapping, the Output file now looks like this:

MapForce Ouput window

As a further review, we can open the generated XML Schema in XMLSpy and display it in graphical Schema View:

Altova XMLSpy graphical schema view of the generated .xsd

So far we have:

  • built a MapForce mapping that queries the Groupon API for all divisions
  • extracted the division id fields
  • and built a list of URLs for API queries to get the deals in each division

In the next post in this series we will process the list of deal queries as the input for a new mapping component and filter the output for some interesting information.

Find out for yourself how easy it is to apply MapForce to convert data from a Web API! Download a free 30-day trial of MapForce.

Editor’s Note: Our original series on mapping data from the Groupon API ran in three parts you can see by clicking the links here:

Part 1 of Processing the Groupon API with Altova MapForce describes how to create dynamic input by collecting data from multiple URLs.

Processing the Groupon API with MapForce – Part 2 describes how we filtered data from the API and defined the output to extract only the most interesting details.

Processing the Groupon API – Part 3 describes formatting the output as a single HTML document optimized for desktop and mobile devices, and reviews ways to automate repeat execution.

2 comments:

DaveMcG said...

Part 2 of this series is available at:
http://blog.altova.com/2011/08/processing-groupon-api-with-mapforce.html

-- davemcg

DaveMcG said...

In Part 3 we take the response from the API and format it in html for a computer or mobile device:

http://blog.altova.com/2011/08/processing-groupon-api-part-3.html