Thanks

]]>I have different approach to build these 256 bit diff encoding where we initialize 1-D matrix, and go on reducing reorganizing for remaining element.

Thanks & regards

Keshav K Shetty

Can you please provide the source that generates these bit-diff codes in unsigned 8bit integers, what I am using right now is your 256Btidff text file and created multi-dimensional array for every number of symbols rack.

for example a function

char get_bitdff_code(int total_symbols, int index);

so that this function returns the code, and we can pick bits from result and populate to output.

I am waiting for your kind response.

bye

]]>Best case happens when data are in descending order. In this case

for first input you need 8 bit = 1×8 = 8bit

for next 128 inputs we need 7 bit each = 128*7=896bits

for next 64 we need 6 bit = 64*6 = 384 and so on

total 1546 bits (or 193.25 bit)

In worst case i.e data are in ascending order

for first 128 inputs we need 8 bit each = 128*8=1024bits

for next 64 we need 7 bit = 64*7 = 448 and so on

total 1793 bits (or 224.125 bit)

Best way to visualize this is by applying diff encoding to the input and see observe bits it generates. (try for smaller data set)

]]>Symbols = 256

Result 1554 bits = 194.25 bytes

Did you mean that one?

]]>1545-1793bits(193.125-224.125bytes)

Thanks

]]>Actually it is my mistake to use wrong word, correct word should non unique value.

When I say unique value, I meant the number (byte) not appeared before in the list, whereas doubles means the number already present in the list.

e.g: lets assume the input bytes are 8, 10, 15, 0, 6, 12, 15, 9, 11, 1, 12, 15

From the list 8, 10, 15, 0, 6, 12, 9, 11, 1 are unique.

Whereas 15,12,15 are doubles or duplicates(non unique).

Note: First 15 and 12 are counted in unique.

First appearance will not treated as non unique.

In the article I mentioned million random digit contains around 90-100 doubles for every block of 256 numbers. That means if it is 90 non unique number then 256-90=166 are unique numbers.

Here on wards I will use the word non unique, instead of doubles or duplicates.

Thanks & regards

Keshav K Shetty