base2048 — screenshot of github.com

base2048

Base2048 is a binary encoding designed to maximize data transmission over Twitter, leveraging its weighted character limits. It allows me to fit up to 385 octets in a single tweet, outperforming older methods like Base65536.

Visit github.com →

Questions & Answers

What is Base2048?
Base2048 is a binary encoding scheme and its JavaScript implementation, designed to efficiently transmit binary data through Twitter. It leverages Twitter's character limits to embed more data than other common encodings.
Who would use Base2048?
Base2048 is for developers or users who need to transmit small amounts of binary data, such as encrypted messages or small files, through Twitter posts while maximizing the data payload within a single tweet's character limits.
How does Base2048 compare to other binary encodings for Twitter?
Base2048 utilizes "light" Unicode characters, allowing up to 385 octets (bytes) per tweet, compared to Base65536 which uses "heavy" characters and thus only accommodates 280 octets in the same 280-character tweet limit. This makes it more efficient for the current Twitter character weighting.
When is Base2048 most useful?
Base2048 is most useful when the goal is to transmit the maximum possible binary data payload within Twitter's 280-character limit, especially considering Twitter's weighted character count that penalizes "heavy" Unicode characters.
What kind of characters does Base2048 use?
Base2048 exclusively uses "light" Unicode code points, specifically 2048 carefully selected "safe" characters (e.g., no control characters, whitespace, or combining diacritics) from the U+0000 to U+10FF range, which Twitter counts as single characters. This allows for higher data density per tweet.