On Thu, 1 Feb 2024 14:17:10 +0100, "Carlos E. R." <robin.listas@telefonica.net> wrote:
On 2024-02-01 13:52, Robert Webb via openSUSE Users wrote:
On Thu, 1 Feb 2024 12:39:40 +0100 (CET), "Carlos E. R." <robin.listas@telefonica.net> wrote:
On Thursday, 2024-02-01 at 11:16 -0000, Robert Webb via openSUSE Users wrote:
On Thu, 1 Feb 2024 09:12:45 +0000, Dave Howorth <dave@howorth.org.uk> wrote:
On Thu, 1 Feb 2024 00:53:24 -0600 -pj via openSUSE Users <users@lists.opensuse.org> wrote:
On 02-01-2024 12:31AM, David T-G wrote: > Carlos & PJ, et al -- > [...]
...
I don't see the error in "my" man page, sorry.
Right. Instead of hex "2d", as you properly have, I see the UTF-8 sequence "e2 80 90" (Unicode U+2010 Hyphen) in "my" man page.
What do you think of this?: $ man susepaste |col -bx |grep -o -e '-' |wc -l 1
Yes, there is just one hyphen-minus in the whole man page. It is in the title line.
For me: $ locale LANG=en_US.UTF-8 LC_CTYPE="en_US.UTF-8" LC_NUMERIC="en_US.UTF-8" LC_TIME="en_US.UTF-8" LC_COLLATE=C LC_MONETARY="en_US.UTF-8" LC_MESSAGES="en_US.UTF-8" LC_PAPER="en_US.UTF-8" LC_NAME="en_US.UTF-8" LC_ADDRESS="en_US.UTF-8" LC_TELEPHONE="en_US.UTF-8" LC_MEASUREMENT="en_US.UTF-8" LC_IDENTIFICATION="en_US.UTF-8" LC_ALL=
cer@Telcontar:~> locale LANG=en_US.UTF-8 LC_CTYPE=en_US.UTF-8 LC_NUMERIC=es_ES.utf8 LC_TIME=en_DK.UTF-8 LC_COLLATE=POSIX LC_MONETARY=es_ES.utf8 LC_MESSAGES="en_US.UTF-8" LC_PAPER=es_ES.utf8 LC_NAME=es_ES.utf8 LC_ADDRESS="en_US.UTF-8" LC_TELEPHONE=es_ES.utf8 LC_MEASUREMENT=es_ES.utf8 LC_IDENTIFICATION="en_US.UTF-8" LC_ALL= cer@Telcontar:~>
I'm surprised that your LC_CTYPE value is the same as mine. Isn't that the one that should determine character encodings?
It must be something else. I am using Leap 15.4 on this machine.
Oh, and some of us may have installed susepaste from a different repo. I know I have tried several versions.
Telcontar:~ # zypper info susepaste Loading repository data... Reading installed packages...
Information for package susepaste: ---------------------------------- Repository : Update repository of openSUSE Backports Name : susepaste Version : 0.6-bp154.3.3.1 Arch : noarch Vendor : openSUSE Installed Size : 41.8 KiB Installed : Yes Status : up-to-date Source package : susepaste-0.6-bp154.3.3.1.src Upstream URL : https://susepaste.org Summary : Script for using openSUSE paste Description : A script for using the openSUSE paste service. You can paste either the file or input from stdin.
Telcontar:~ #
I think that is current the official version.
And mine on Tumbleweed: $ zypper if susepaste [...] Information for package susepaste: ---------------------------------- Repository : Main Repository (OSS) Name : susepaste Version : 0.6-7.1 Arch : noarch Vendor : openSUSE Installed Size : 41.8 KiB Installed : Yes (automatically) Status : up-to-date Source package : susepaste-0.6-7.1.src Upstream URL : https://github.com/openSUSE/paste Summary : Script for using openSUSE paste Description : A script for using the openSUSE paste service. You can paste either the file or input from stdin.
When I tried setting LC_ALL=C, the man page was OK (using hex 2d). OK also with LC_CTYPE=C .
I only noticed the problem recently, with not just the susepaste man page BTW, so maybe there is a difference between Leap and Tumbleweed (which I am using). Some man pages are OK, though, and some use mixed hyphen-minus characters in some places, and then semi-randomly other "hyphens" elsewhere (man rsync).
Maybe you should write a bugzilla. My guess is that the source of the manpage does have the "wrong" hyphen, but depending on something there is a conversion to the correct hyphen.
# zcat /usr/share/man/man1/susepaste.1.gz | hexdump -C | less
I see the 2d character there. I don't think the above does any conversion, but troff/groff maybe does.
The same. I only see the hyphen-minus character ('-' U+002D) and no hyphens ('‐' U+2010) in the source nroff file. $ for UCODE in 002D 2010 ;do eval "CHAR=$'\\u$UCODE'" ;printf '=== U+%s: %s: ' "$UCODE" "$CHAR" zcat /usr/share/man/man1/susepaste.1.gz |grep -o -e "$CHAR" |wc -l done === U+002D: -: 26 === U+2010: ‐: 0 -- Robert Webb