2023-10-13 10:59:44 +00:00
|
|
|
given a string A, we want it to be B.
|
|
|
|
|
|
|
|
A: hello world and thanks for the fish.
|
|
|
|
B: hello cruel o_o world and thanks for the fish!
|
|
|
|
|
2023-10-23 13:25:03 +00:00
|
|
|
however, the instructions to reconstruct B must be as small as possible, to minimize transmission costs.
|
|
|
|
this is why we dont transmit B entirely.
|
2023-10-13 10:59:44 +00:00
|
|
|
|
|
|
|
different algorithms as follow:
|
|
|
|
|
|
|
|
## ALGORITHM 1
|
|
|
|
|
|
|
|
- identify the first mismatching character (S).
|
|
|
|
|
2023-10-23 13:25:03 +00:00
|
|
|
v-- S@6
|
2023-10-13 10:59:44 +00:00
|
|
|
A: hello world and thanks for the fish.
|
|
|
|
B: hello cruel o_o world and thanks for the fish!
|
|
|
|
|
|
|
|
- identify the last mismatch character (E).
|
|
|
|
|
2023-10-23 13:25:03 +00:00
|
|
|
v-- E@-0
|
2023-10-13 10:59:44 +00:00
|
|
|
A: hello world and thanks for the fish.
|
|
|
|
B: hello cruel o_o world and thanks for the fish!
|
|
|
|
|
|
|
|
- we construct the patch now with 3 numbers:
|
2023-10-23 13:25:03 +00:00
|
|
|
- number of bytes to copy from A[0] till A[S] (not included)
|
|
|
|
- number of bytes to copy from B[S] till B[E] (not included) + plus the substring that is get copied.
|
|
|
|
- number of bytes to copy from A[E] till end of A string.
|
2023-10-13 10:59:44 +00:00
|
|
|
|
|
|
|
6 40 "cruel o_o world and thanks for the fish!" 0
|
|
|
|
|
2023-10-23 13:25:03 +00:00
|
|
|
- total patch size is 3 control bytes [6,40,0] + 40 (string) = 43 bytes
|
2023-10-13 10:59:44 +00:00
|
|
|
|
|
|
|
## ALGORITHM 2
|
|
|
|
|
2023-10-23 13:25:03 +00:00
|
|
|
- We delta every character in both strings, from head to tail, and from tail to head.
|
2023-10-13 10:59:44 +00:00
|
|
|
|
|
|
|
A: hello world and thanks for the fish.0000000000
|
2023-10-23 13:25:03 +00:00
|
|
|
B: hello cruel l_o world and thanks for the fish!
|
2023-10-13 10:59:44 +00:00
|
|
|
C: 000000XXXXX0XXX0XXXXXXXXXXXXXXXXXXXXXXXXXXXXXX (-)
|
|
|
|
|
|
|
|
A: 0000000000hello world and thanks for the fish.
|
2023-10-23 13:25:03 +00:00
|
|
|
B: hello cruel l_o world and thanks for the fish!
|
|
|
|
C: XXXXXXXXXXXX0X0000000000000000000000000000000X (-)
|
|
|
|
|
|
|
|
- Pick the choice with most zeros (2nd choice) (aka, select the choice with less Xs).
|
|
|
|
- On C lane, promote gaps of one or two `0`s into `X`s.
|
|
|
|
|
|
|
|
A: 0000000000hello world and thanks for the fish.\0
|
|
|
|
B: hello cruel l_o world and thanks for the fish!\0
|
|
|
|
C: XXXXXXXXXXXXXX0000000000000000000000000000000X 0 (-)
|
|
|
|
^
|
|
|
|
|
|
|
|
- Do the number of leading Xs(C) - number of leading 0s(A) (14-10). If positive, write it [4].
|
|
|
|
This number will be needed in the patch function.
|
|
|
|
- Do { Encode every XXXX island in C as a positive number indicating how many bytes to copy from.
|
|
|
|
Otherwise, run-length the number of zeros into a negative number.
|
|
|
|
} repeat till lane is exhausted.
|
|
|
|
|
|
|
|
4
|
|
|
|
XXXXXXXXXXXXXXX: +14 "hello cruel l_"
|
|
|
|
0000000000000000000000000000000: -31
|
2023-10-13 10:59:44 +00:00
|
|
|
X: +1 "!"
|
|
|
|
|
2023-10-23 13:25:03 +00:00
|
|
|
- Patch size is 4 control bytes [4,+14,-31,+1] + 14 (string) + 1 (string) = 19 bytes
|
2023-10-13 10:59:44 +00:00
|
|
|
|
|
|
|
## ALGORITHM 3
|
|
|
|
|
|
|
|
- start source (A) and target (B) strings. start with A:
|
|
|
|
- select how many bytes to copy (positive), how many to skip (negative), or switch A<-->B lanes (zero).
|
|
|
|
- repeat previous step over and over until both lanes do reach their respective string ends.
|
|
|
|
|
|
|
|
A: hello world and thanks for the fish.
|
|
|
|
B: hello cruel o_o world and thanks for the fish!
|
|
|
|
|
|
|
|
A: +6 0 >> hello
|
|
|
|
B: -6 +10 0 >> hello cruel o_o
|
|
|
|
A: +24 -1 0 >> hello cruel o_o world and thanks for the fish
|
|
|
|
B: -29 +1 >> hello cruel o_o world and thanks for the fish!
|
|
|
|
(10 bytes)
|
|
|
|
|
|
|
|
what if we always specify a dual +/- operation per lane?
|
|
|
|
lanes changed automatically after every tuple. in this case a 0 shall indicate a no op.
|
|
|
|
|
|
|
|
A: +6 0 >> hello
|
|
|
|
B: -6 +10 >> hello cruel o_o
|
|
|
|
A: +24 -1 >> hello cruel o_o world and thanks for the fish
|
|
|
|
B: -29 +1 >> hello cruel o_o world and thanks for the fish!
|
|
|
|
(8 bytes)
|
|
|
|
|
|
|
|
- we construct the patch now. a substring excerpt must come after any positive operations on lane B.
|
|
|
|
|
|
|
|
+6 0 -6 +10 "cruel o_o " +24 -1 -29 +1 "!"
|
|
|
|
|
|
|
|
- total patch size is 8 control bytes + 10 (string) + 1 (string) = 19 bytes
|