Bitcoin Forum
June 12, 2026, 05:42:26 PM *
News: Latest Bitcoin Core release: 31.0 [Torrent]
 
   Home   Help Search Login Register More  
Pages: [1]
  Print  
Author Topic: BIP39 multilingual display wordlists, looking for review and native-speaker feed  (Read 73 times)
daniosem (OP)
Newbie
*
Offline

Activity: 51
Merit: 0


View Profile
June 08, 2026, 05:43:19 PM
 #1

Hi everyone,

I wanted to share an open-source contribution for BIP39 recovery UX:

https://github.com/osem23/bip39-wordlists-tzur

This is not a new seed standard, not a wallet promotion, and not a replacement for the existing BIP39 wordlists.

The goal is to make BIP39 recovery more accessible to users who are not comfortable with English.

Many Bitcoin users around the world still have to write down and recover seed phrases in English, even when they do not understand English well.

For English speakers, seed words are familiar words.

For many non-English users, they can feel like random foreign symbols that must be copied perfectly.

This project provides native-language display and input wordlists for BIP39 recovery UX.

Important clarification:

The canonical BIP39 seed flow remains unchanged.

The English BIP39 mnemonic remains the source of truth.

PBKDF2 still runs on the canonical BIP39 form.

The native-language words are an optional display/input layer, not a new cryptographic scheme.

The repo includes wordlists and mappings for additional languages, with the goal of making recovery easier to understand for users who are not comfortable with English.

I understand that the original BIP39 wordlists were not simple direct translations. They were designed with important constraints such as avoiding confusing words, avoiding multiple spellings, avoiding similar-sounding words, and supporting short-prefix identification where possible.

Feedback is very welcome, especially around:

Are native-language display/input layers useful for wallet recovery UX?

What risks does this approach create?

Should something like this stay outside BIP39 as a wallet-specific UX layer?

Are there languages where this approach is especially risky?

How should native-speaker review be handled?

Are there better ways to help non-English users recover safely without changing BIP39?

I would especially appreciate review from wallet developers, BIP39 implementers, and native speakers of the included languages.

GitHub:
https://github.com/osem23/bip39-wordlists-tzur

Thank you.
ABCbits
Legendary
*
Offline

Activity: 3640
Merit: 10105



View Profile
June 09, 2026, 07:52:40 AM
 #2

Feedback is very welcome, especially around:

Some language such as Korean already exist on BIP 39 itself. When i compare the word list between actual BIP 39 and yours, it's different.

https://github.com/bitcoin/bips/blob/master/bip-0039/korean.txt
https://github.com/osem23/bip39-wordlists-tzur/blob/main/wordlists/tzur-original/korean.txt

Personally i would recommend to remove all language that already exist on BIP 39 itself.

What risks does this approach create?

People could make wrong assumption recovery words generated by your Tzur wallet can be used on other wallet.

███████████████████████████
███████▄████████████▄██████
████████▄████████▄████████
███▀█████▀▄███▄▀█████▀███
█████▀█▀▄██▀▀▀██▄▀█▀█████
███████▄███████████▄███████
███████████████████████████
███████▀███████████▀███████
████▄██▄▀██▄▄▄██▀▄██▄████
████▄████▄▀███▀▄████▄████
██▄███▀▀█▀██████▀█▀███▄███
██▀█▀████████████████▀█▀███
███████████████████████████
.
.Duelbits PREDICT..
█████████████████████████
█████████████████████████
███████████▀▀░░░░▀▀██████
██████████░░▄████▄░░████
█████████░░████████░░████
█████████░░████████░░████
█████████▄▀██████▀▄████
████████▀▀░░░▀▀▀▀░░▄█████
██████▀░░░░██▄▄▄▄████████
████▀░░░░▄███████████████
█████▄▄█████████████████
█████████████████████████
█████████████████████████
.
.WHERE EVERYTHING IS A MARKET..
█████
██
██







██
██
██████
Will Bitcoin hit $200,000
before January 1st 2027?

    No @1.15         Yes @6.00    
█████
██
██







██
██
██████

  CHECK MORE > 
bitmover
Legendary
*
Online Online

Activity: 3066
Merit: 7498


Trêvoid █ No KYC-AML Crypto Swaps


View Profile WWW
June 09, 2026, 02:35:41 PM
 #3

There are already other wordlists of different languages

Portuguese community of this forum even made the Portuguese bip39 words in 2020, which was merged into the bitcoin github

https://bitcointalk.org/index.php?topic=5273606.msg55130902#msg55130902

▄▄████████████████████▄▄
▄███████▀▀██████▀▀███████▄
████████████████████████
████████▄▄██████▄▄██████

████████████████████████
██▄▄█████████████▄▄██████
██▀▀██████████████████▄▄██
██████▀▀██████████████▀▀██
██████████████████████████
██████▀▀██████▀▀████████
████████████████████████
▀███████▄▄██████▄▄███████▀
▀▀████████████████████▀▀
 
 DΞX.fo 
▄▄██████
█████████
██████████
█████████
██████████
█████████
▀▀██████

▄███████
▄██████████
████████████
█████████████
█████████████
|
▄▄█
▄████▀
▄███▀
▄██▀▄██
█████▀▀
███████
████████
▀██▄████
▄████▄▄
▄█████▀███
▄█████▀████
█████▀███████
▀██▀█████████
|  BTC     XMR  
  DAI     LTC  
   Fees  0.8%    
nc50lc
Legendary
*
Offline

Activity: 3178
Merit: 8850


Self-proclaimed Genius


View Profile
June 10, 2026, 03:54:23 AM
 #4

The English BIP39 mnemonic remains the source of truth.
Perhaps this is a mistranslation?
Because "source of truth" sounds nothing like a technical term.

Do you mean the "entropy"?
So, in other words, the mnemonic remains in English when used to derive the binary seed.
Your implementation just maps it to other unsupported languages for native language backup, but everything in the background follows BIP39 spec, correct?

Quote from: daniosem
Should something like this stay outside BIP39 as a wallet-specific UX layer?
The main problem that I'm seeing is if it's wallet-specific:
Non-tech savvy users might rely too much on the "translated mnemonic" as a backup and will come into issue when they try to restore it to others wallets that did not implement this.
Even though BIP39 is still used in the background, the translated mnemonic itself will cause incompatibilities if it's not implemented by the majority of the wallets.
So something like this should be an update to BIP39 standard rather than wallet-specific.

But I think updating the BIP39 word list to accommodate other languages should be enough if updating BIP39 is an option.
That way, it wont need any mapping and just derives that binary seed directly from the normalized local characters.

███████████████████████████
███████▄████████████▄██████
████████▄████████▄████████
███▀█████▀▄███▄▀█████▀███
█████▀█▀▄██▀▀▀██▄▀█▀█████
███████▄███████████▄███████
███████████████████████████
███████▀███████████▀███████
████▄██▄▀██▄▄▄██▀▄██▄████
████▄████▄▀███▀▄████▄████
██▄███▀▀█▀██████▀█▀███▄███
██▀█▀████████████████▀█▀███
███████████████████████████
.
.Duelbits PREDICT..
█████████████████████████
█████████████████████████
███████████▀▀░░░░▀▀██████
██████████░░▄████▄░░████
█████████░░████████░░████
█████████░░████████░░████
█████████▄▀██████▀▄████
████████▀▀░░░▀▀▀▀░░▄█████
██████▀░░░░██▄▄▄▄████████
████▀░░░░▄███████████████
█████▄▄█████████████████
█████████████████████████
█████████████████████████
.
.WHERE EVERYTHING IS A MARKET..
█████
██
██







██
██
██████
Will Bitcoin hit $200,000
before January 1st 2027?

    No @1.15         Yes @6.00    
█████
██
██







██
██
██████

  CHECK MORE > 
daniosem (OP)
Newbie
*
Offline

Activity: 51
Merit: 0


View Profile
June 10, 2026, 05:58:17 PM
Last edit: June 10, 2026, 09:23:38 PM by achow101
 #5

Feedback is very welcome, especially around:

Some language such as Korean already exist on BIP 39 itself. When i compare the word list between actual BIP 39 and yours, it's different.

https://github.com/bitcoin/bips/blob/master/bip-0039/korean.txt
https://github.com/osem23/bip39-wordlists-tzur/blob/main/wordlists/tzur-original/korean.txt

Personally i would recommend to remove all language that already exist on BIP 39 itself.

What risks does this approach create?

People could make wrong assumption recovery words generated by your Tzur wallet can be used on other wallet.

Thank you, this is a good point.

You are right that Korean already exists in the official BIP-39 wordlists, and using a different TZUR-specific Korean list can create unnecessary confusion.

The goal of TZUR’s multilingual layer is not to replace BIP-39 or create a new seed standard. The canonical recovery phrase remains the standard English BIP-39 mnemonic. The localized words are only a deterministic display layer mapped back to the English BIP-39 indexes.

I agree that for languages already included in official BIP-39, we should either use the official list or remove the custom version. That would reduce confusion and make compatibility expectations clearer.

The main risk is exactly what you mentioned: users may assume the localized words can be restored in any wallet. This needs to be clearly explained in the UI and documentation. The English BIP-39 phrase is the portable backup. The localized display should be treated as wallet-specific unless it uses an official BIP-39 list.

I appreciate the feedback and will review the existing language lists accordingly.



The English BIP39 mnemonic remains the source of truth.
Perhaps this is a mistranslation?
Because "source of truth" sounds nothing like a technical term.

Do you mean the "entropy"?
So, in other words, the mnemonic remains in English when used to derive the binary seed.
Your implementation just maps it to other unsupported languages for native language backup, but everything in the background follows BIP39 spec, correct?

Quote from: daniosem
Should something like this stay outside BIP39 as a wallet-specific UX layer?
The main problem that I'm seeing is if it's wallet-specific:
Non-tech savvy users might rely too much on the "translated mnemonic" as a backup and will come into issue when they try to restore it to others wallets that did not implement this.
Even though BIP39 is still used in the background, the translated mnemonic itself will cause incompatibilities if it's not implemented by the majority of the wallets.
So something like this should be an update to BIP39 standard rather than wallet-specific.

But I think updating the BIP39 word list to accommodate other languages should be enough if updating BIP39 is an option.
That way, it wont need any mapping and just derives that binary seed directly from the normalized local characters.

Thank you, that is a good clarification.

You are right that “source of truth” was not the most precise technical wording.

What I meant is:

The wallet generates standard BIP-39 entropy.
That entropy is encoded into the standard English BIP-39 mnemonic.
The English mnemonic is then used normally to derive the binary seed according to the BIP-39 spec.
The localized words are only a deterministic display layer mapped by index to the English BIP-39 wordlist.

So yes, everything in the background still follows BIP-39. The localized phrase is not used directly as the BIP-39 mnemonic unless that language is already an official BIP-39 wordlist and the implementation explicitly supports it.

I also agree with your concern about wallet-specific UX. The biggest risk is that non-technical users may treat the translated phrase as a universal backup and later try to restore it in another wallet that does not support the same mapping. That would create a bad recovery experience, even if the underlying English BIP-39 phrase is still valid.

That is why I am leaning toward one of two safer approaches:

1. For languages that already exist in official BIP-39, use the official list only.
2. For languages not included in BIP-39, clearly label the localized phrase as a TZUR display layer and always provide/export the standard English BIP-39 recovery phrase as the portable backup.

I understand your point that if this is meant to be broadly interoperable, it should ideally be standardized rather than wallet-specific. A wallet-specific mapping can improve UX, but it cannot create ecosystem compatibility by itself.

Updating or extending BIP-39 wordlists would be the cleaner interoperability path, because wallets could derive the seed directly from the normalized local-language mnemonic without requiring a separate mapping layer. The hard part is that such lists need very careful native review, uniqueness rules, normalization rules, typo resistance, and broad wallet adoption.

So for now, I think the safest framing is:

This should not be presented as “translated BIP-39” unless it is actually using an official BIP-39 wordlist.
It should be presented as a localized display layer, with the English BIP-39 mnemonic remaining the portable recovery phrase.

I appreciate the correction. I will adjust the wording to be more technically accurate.

Mod note: Consecutive posts merged
Pages: [1]
  Print  
 
Jump to:  

Powered by MySQL Powered by PHP Powered by SMF 1.1.19 | SMF © 2006-2009, Simple Machines Valid XHTML 1.0! Valid CSS!