This line has a sneaky Unicode dash ? right here.
This line has curly quotes ?like these?.
This line has a non-breaking space between words.
type unicode2ascii.bat@echo off
:: unicode2ascii.bat
:: This batch file runs a PowerShell script that removes all non-ASCII
:: characters from unicode.txt and writes the cleaned output to ascii.txt. powershell -NoProfile -ExecutionPolicy Bypass -File unicode2ascii.ps1
Bear in mind there is much more than just Unicode characters in
pasted web-page text as Unicode is only the container; the real trouble
comes from the variety of characters inside it such as zero-width
spaces & joiners, directional control characters, soft hyphens, etc.
On 12/31/2025 7:21 PM, Marian wrote:
Bear in mind there is much more than just Unicode characters in
pasted web-page text as Unicode is only the container; the real trouble
comes from the variety of characters inside it such as zero-width
spaces & joiners, directional control characters, soft hyphens, etc.
I don't understand the problem. In these days, (nearly) any web page
and usnet posting uses utf-8 character encoding. Also your posting
uses utf-8;
Content-Type: text/plain; charset=UTF-8; format=flowed
User-Agent: tin/1.6.2-20030910 ("Pabbay") (UNIX) (CYGWIN_NT-10.0-WOW/2.8.0(0.309/5/3) (i686)) Hamster/2.0.2.2
There shouldn't be any problem when you copy text from a web page and
past it into an usnet posting. There is only a problem if you
convert the utf-8 text into "something else" and then paste it
into the posting. You don't solve a problem, you create a problem.
On 12/31/2025 7:21 PM, Marian wrote:
Bear in mind there is much more than just Unicode characters in
pasted web-page text as Unicode is only the container; the real trouble comes from the variety of characters inside it such as zero-width
spaces & joiners, directional control characters, soft hyphens, etc.
I don't understand the problem. In these days, (nearly) any web page
and usnet posting uses utf-8 character encoding. Also your posting
uses utf-8;
Content-Type: text/plain; charset=UTF-8; format=flowed
User-Agent: tin/1.6.2-20030910 ("Pabbay") (UNIX) (CYGWIN_NT-10.0-WOW/2.8.0(0.309/5/3) (i686)) Hamster/2.0.2.2
There shouldn't be any problem when you copy text from a web page and
past it into an usnet posting. There is only a problem if you
convert the utf-8 text into "something else" and then paste it
into the posting. You don't solve a problem, you create a problem.
Just post a link to a web page which has a problem with copy&paste
text into an usnet posting.
| Sysop: | Tetrazocine |
|---|---|
| Location: | Melbourne, VIC, Australia |
| Users: | 15 |
| Nodes: | 8 (0 / 8) |
| Uptime: | 173:05:26 |
| Calls: | 188 |
| Files: | 21,502 |
| Messages: | 80,021 |