| http://www.w3.org/ns/prov#value | - tdout USING PigStreaming(','))> ship ('$scriptDir/analyze_unif_recs.py');>> UnifLines = load '$unif_xml'> using org.apache.pig.piggybank.storage.XMLLoader('REC')> as (doc:chararray);> UnifXmlByDocId = stream UnifLines through analyze_unif> as (docid : int,> xml_comp: chararray> );>> where analyze_unif_recs.py is a python script I wrote that does the xml> parsing, and org.apache.pig.piggybank.stora
|