Skip to main content

You are not logged in. Your edit will be placed in a queue until it is peer reviewed.

We welcome edits that make the post easier to understand and more valuable for readers. Because community members review edits, please try to make the post substantially better than how you found it, for example, by fixing grammar or adding additional resources and hyperlinks.

Required fields*

5
  • For most locales it ultimately comes from the LC_CTYPE stuff in (with glibc) /usr/share/i18n/locales/i18n ... which of course comes largely from the Unicode Character Database. Of course, it would be nice to have a command Commented May 6, 2014 at 21:11
  • @derobert, yes, while locale (at least the GNU one) retrieves many of the informations stored in many of the categories, things it doesn't are the most important ones in LC_CTYPE and LC_COLLATE. I wonder if there's a hidden API to retrieve that information or uncompile the locale information. Commented May 6, 2014 at 22:01
  • Yeah - you can get that info parsed - I just finally got around to wrapping up my edit. There are several commands you probably already have installed - at least I did, and I didn't even know about them. I hope it helps. Specifically recode and uconv can give you what you what you say you're looking for. Possibly even just luit and od I guess... Commented May 7, 2014 at 2:35
  • Thats very good! That means you dont need perl at all, i think. Commented May 7, 2014 at 16:17
  • I seem to be able to basically extract my charset from LC_CTYPE with just od -A n -t c <LC_CTYPE | tsort Probably you've tried it already, but I'd never heard of it before and I was reading through info and it reminded me of this - and it seems to work. There's also ptx but I think it's less relevant. Anyway, if you haven't tried it and decide to do so - fair warning - it does require a little patience. lehman.cuny.edu/cgi-bin/man-cgi?tsort+1 Commented May 13, 2014 at 4:08