hadoop - Trying to load indexed LZO file using LzoPigStorage and elephant-bird -
i've got log file default lzo compression , .index file generated using hadoop-lzo, when run simple pig file retrieve top 100 records using lzopigstorage, following exception:
message: unexpected system error occured: java.lang.runtimeexception: java.lang.reflect.invocationtargetexception @ org.apache.pig.backend.hadoop23.pigjobcontrol.submit(pigjobcontrol.java:130) @ org.apache.pig.backend.hadoop23.pigjobcontrol.run(pigjobcontrol.java:191) @ java.lang.thread.run(thread.java:724) @ org.apache.pig.backend.hadoop.executionengine.mapreducelayer.mapreducelauncher$1.run(mapreducelauncher.java:257) caused by: java.lang.reflect.invocationtargetexception @ sun.reflect.nativemethodaccessorimpl.invoke0(native method) @ sun.reflect.nativemethodaccessorimpl.invoke(nativemethodaccessorimpl.java:57) @ sun.reflect.delegatingmethodaccessorimpl.invoke(delegatingmethodaccessorimpl.java:43) @ java.lang.reflect.method.invoke(method.java:606) @ org.apache.pig.backend.hadoop23.pigjobcontrol.submit(pigjobcontrol.java:128) ... 3 more caused by: java.lang.incompatibleclasschangeerror: found interface org.apache.hadoop.mapreduce.jobcontext, class expected @ com.twitter.elephantbird.mapreduce.input.lzoinputformat.liststatus(lzoinputformat.java:55) @ org.apache.hadoop.mapreduce.lib.input.fileinputformat.getsplits(fileinputformat.java:269) @ com.twitter.elephantbird.mapreduce.input.lzoinputformat.getsplits(lzoinputformat.java:111) @ org.apache.pig.backend.hadoop.executionengine.mapreducelayer.piginputformat.getsplits(piginputformat.java:274) @ org.apache.hadoop.mapreduce.jobsubmitter.writenewsplits(jobsubmitter.java:452) @ org.apache.hadoop.mapreduce.jobsubmitter.writesplits(jobsubmitter.java:469) @ org.apache.hadoop.mapreduce.jobsubmitter.submitjobinternal(jobsubmitter.java:366) @ org.apache.hadoop.mapreduce.job$11.run(job.java:1269) @ org.apache.hadoop.mapreduce.job$11.run(job.java:1266) @ java.security.accesscontroller.doprivileged(native method) @ javax.security.auth.subject.doas(subject.java:415) @ org.apache.hadoop.security.usergroupinformation.doas(usergroupinformation.java:1408) @ org.apache.hadoop.mapreduce.job.submit(job.java:1266) @ org.apache.hadoop.mapreduce.lib.jobcontrol.controlledjob.submit(controlledjob.java:336)
i running hadoop 2.0, pig 0.11, , elephant-bird 2.2.3
i don't use elephant-bird, i'm not entirely sure problem.
but looking @ build v2.2.3, compiled against hadoop-0.20.2 , pig-0.9.2. i've seen problems udfs in pig when running on newer version udf compiled against.
are able upgrade elephant-bird newer version or recompile against correct libraries?
Comments
Post a Comment