Program to compress a file in Snappy format

2015-02-10T17:10:13-05:00

Hi Amal
Is there any hadoop command to convert input data into snappy compressed format?

2015-02-11T04:27:02-05:00

you can use hive or pig with compression enabled for converting files to snappy. Direct command is not available. You can use my program and convert it as a jar. This will help you in converting files to snappy.

Reply

2015-02-12T04:27:55-05:00

Thanks Amal…

2015-04-18T04:30:22-05:00

When i run the program i see this error. Any idea on this?

Exception in thread “main” java.lang.NoSuchMethodError: org.apache.hadoop.io.compress.CodecPool.getCompressor(Lorg/apache/hadoop/io/compress/CompressionCodec;Lorg/apache/hadoop/conf/Configuration;)Lorg/apache/hadoop/io/compress/Compressor;
at org.apache.hadoop.io.compress.CompressionCodec$Util.createOutputStreamWithCodecPool(CompressionCodec.java:131)
at org.apache.hadoop.io.compress.SnappyCodec.createOutputStream(SnappyCodec.java:98)
at hive.HiveJdbcClient.main(HiveJdbcClient.java:34)

Reply

2015-04-18T05:19:15-05:00

Seems like this is because of a library issue. Which version/distribution of hadoop are you using .?

Reply

2015-11-04T18:44:28-05:00

Amal, I am new to Hadoop. The above code help me to compress the file in local file system. I want something to do the same in HDFS. Could you please get me the code snippet.

Reply

2017-03-02T18:33:26-05:00

can you confirm the versions of the jar files which used? I’m getting below error,
log4j:WARN No appenders could be found for logger (org.apache.hadoop.util.NativeCodeLoader).
log4j:WARN Please initialize the log4j system properly.
log4j:WARN See http://logging.apache.org/log4j/1.2/faq.html#noconfig for more info.
java.lang.RuntimeException: native snappy library not available: this version of libhadoop was built without snappy support.
org.apache.hadoop.io.compress.SnappyCodec.checkNativeCodeLoaded(SnappyCodec.java:65)
org.apache.hadoop.io.compress.SnappyCodec.getCompressorType(SnappyCodec.java:134)
org.apache.hadoop.io.compress.CodecPool.getCompressor(CodecPool.java:150)
org.apache.hadoop.io.compress.CompressionCodec$Util.createOutputStreamWithCodecPool(CompressionCodec.java:131)
org.apache.hadoop.io.compress.SnappyCodec.createOutputStream(SnappyCodec.java:100)
com.snappy.codec.CreateSnappy.main(CreateSnappy.java:35)
Do you have any idea on this?

Reply

2017-03-02T22:20:13-05:00

I have taken the jars from my CDH4 cluster. Not exactly remembering the component versions.

Reply

2017-04-28T18:30:42-05:00

Hi Amol,
Could you please explain in details what is snappy exactly.
How it will improve performance and also will it compress jar’s

Reply

	package com.snappy.codec;

	/*
	* @author : Amal G Jose
	*
	*/
	import java.io.BufferedInputStream;
	import java.io.BufferedOutputStream;
	import java.io.FileInputStream;
	import java.io.FileOutputStream;
	import java.io.InputStream;
	import java.io.OutputStream;

	import org.apache.hadoop.conf.Configuration;
	import org.apache.hadoop.io.compress.CompressionCodec;
	import org.apache.hadoop.io.compress.SnappyCodec;
	import org.apache.hadoop.util.ReflectionUtils;

	/*
	*This program compresses the given file in snappy format
	*
	*/

	public class CreateSnappy {
	public static void main(String[] args) {
	if (args.length < 2) {
	System.out.println("Enter <input> <output>");
	System.exit(0);
	}

	try {
	CompressionCodec codec = (CompressionCodec) ReflectionUtils
	.newInstance(SnappyCodec.class, new Configuration());
	OutputStream outStream = codec
	.createOutputStream(new BufferedOutputStream(
	new FileOutputStream(args[1])));
	InputStream inStream = new BufferedInputStream(new FileInputStream(
	args[0]));
	int readCount = 0;
	byte[] buffer = new byte[64 * 1024];
	while ((readCount = inStream.read(buffer)) > 0) {
	outStream.write(buffer, 0, readCount);
	}
	inStream.close();
	outStream.close();
	System.out.println("File Compressed");

	} catch (Exception e) {
	e.printStackTrace();
	}
	}
	}

All About Tech

Victory goes to the player who makes the next-to-last mistake

Program to compress a file in Snappy format

9 thoughts on “Program to compress a file in Snappy format”

Leave a comment Cancel reply

Share this:

Related

9 thoughts on “Program to compress a file in Snappy format”

Leave a comment Cancel reply