Skip to main content
edited tags
Link
Christophe
  • 81.4k
  • 11
  • 133
  • 200
added 15 characters in body
Source Link
Lance Pollard
  • 2.7k
  • 1
  • 23
  • 41

What I'm wondering is, if there is a way to automatically find the best encoding for the bytes. Automatically find all the sequences that can be cachedput into a dictionary. I don't see how that's not possible, but I imagine it is otherwise it would've been done already. It seems like it would be best solved in the area of DNA sequence analysis.

What I'm wondering is, if there is a way to automatically find the best encoding for the bytes. Automatically find all the sequences that can be cached. I don't see how that's not possible, but I imagine it is otherwise it would've been done already. It seems like it would be best solved in the area of DNA sequence analysis.

What I'm wondering is, if there is a way to automatically find the best encoding for the bytes. Automatically find all the sequences that can be put into a dictionary. I don't see how that's not possible, but I imagine it is otherwise it would've been done already. It seems like it would be best solved in the area of DNA sequence analysis.

added 7 characters in body; edited title
Source Link
Christophe
  • 81.4k
  • 11
  • 133
  • 200

The reason why you can't compress Algorithm for optimizing text like thiscompression

I have seen someam looking for text compression stuffalgorithms (natural language compression, as opposed to arbitrary byte or integerrather than compression of arbitrary binary data), such as .

I have seen for example An Efficient Compression Code for Text DatabasesAn Efficient Compression Code for Text Databases. It sounds like theyThis algorithm basically useuses the words as symbols, createcreates a dictionary from them, and replacereplaces them with integers. So something like this:

Then that would mean the text is turned into:

The reason why you can't compress text like this

I have seen some text compression stuff (natural language compression, as opposed to arbitrary byte or integer compression), such as An Efficient Compression Code for Text Databases. It sounds like they basically use the words as symbols, create a dictionary from them, and replace them with integers. So something like this:

Then that would mean the text is turned into:

Algorithm for optimizing text compression

I am looking for text compression algorithms (natural language compression, rather than compression of arbitrary binary data).

I have seen for example An Efficient Compression Code for Text Databases. This algorithm basically uses the words as symbols, creates a dictionary from them, and replaces them with integers. So something like this:

Then the text is turned into:

deleted 2 characters in body
Source Link
Lance Pollard
  • 2.7k
  • 1
  • 23
  • 41
Loading
Source Link
Lance Pollard
  • 2.7k
  • 1
  • 23
  • 41
Loading