Questions tagged [text-encoding]
The text-encoding tag has no summary.
26 questions
5
votes
9
answers
3k
views
Methods to increase the amount of data sent in a packet
I have been working on launching high-altitude balloons (HABs, or weather balloons) and I have been using LoRa to enable long-range communication with my balloons. It's been great and pretty reliable, ...
-2
votes
1
answer
4k
views
How to identify encoding of a text string? [closed]
I guess most of you already met them. You get them from your data sources, see them in your logs, or in the output from your legacy systems. Some strings you can't really read.
To derive any useful ...
2
votes
2
answers
552
views
Compressing EBCDIC file vs UTF8
Today I went across a weird case for which I have no explanation, so here I am.
I have two files with identical content, but one is encoded in UTF-8 and the other one is in IBM EBCDIC. Both of them ...
5
votes
1
answer
382
views
UTF-8 questions
When you encode a code point to code units based on UTF-8, then if the code point fits on 7 bits, the most significant bit is set to zero so that it tells you it is a character which is stored on 1 ...
2
votes
4
answers
3k
views
Reduce number of digits by converting to alphanumeric data
We have an app that receives a web service request, processes it and sends it back to our client by another web service call. There is a unique field in the request, a tracking Id, which currently ...
0
votes
1
answer
86
views
Name of data structure where a string determines its value
A few times in my career I've found myself writing decoders for responses from IoT products or weird apis that insted of using JSON or XML as a response, they reply with something like ...
1
vote
1
answer
77
views
Dictionary Coding for OLAP
Let's say we have a generic table like below:
id, name, price, quantity
20 product_x 5,00 100
20 product_y 5,00 100
20 ...
4
votes
4
answers
4k
views
Windows compatibility with Unix/Linux newline "\n"
A follow-up to Difference between '\n' and '\r\n'.
It's been few decades since the schism was introduced. Nowadays, when documents are being exchanged over the internet, typically ...
10
votes
1
answer
3k
views
What is the encoding used in Git's binary patches?
Git can generate patches/diffs for binary files as well as for text files.
I'm trying to figure out what encoding it uses for its binary patches.
Here is an example:
diff --git a/www/images/...
-1
votes
2
answers
289
views
How to find charset name by byte codes?
I have a printer and SDK to work with it in Java. Printer working well with english letters and digits but doesn't print correctly special symbols like 'ä' or 'ê'.
I suppose that I need to convert ...
-1
votes
2
answers
8k
views
Fast and simple hex compression
I'm working on a project that requires a TCP connection between a client and server. The current protocol encodes the data into hex and then sends it. However, hex increases the length of the payload ...
2
votes
1
answer
1k
views
What type of encoding does wikipedia use?
I have the wikipedia data dump and trying to decode special characters in the page titles, except a lot of characters don't match up the "standard" ascii encoding (referencing from here.)
As an ...
4
votes
2
answers
389
views
Why does ISO 8859-1 contain letter-free diacritics?
ISO 8859-1 contains a few letter-free diacritics: The diaeresis (¨), the acute accent (´), the cedilla (¸) and the macron (¯).¹
Why were they included? As far as I know (please correct me if I am ...
7
votes
2
answers
10k
views
How is encoding handled correctly during copy-paste between programs?
Suppose
a program A opens a text file A using encoding A to decode the file, and
a program B opens a text file B using encoding B.
When we copy some text from file B in program B to file A in ...
1
vote
2
answers
67
views
cross-application text formatting
I am not sure whether this question is a good fit for this site, but if it is not, please let me know and I will take it down.
If it is off-topic, some general info on where I can look for these ...