-
Notifications
You must be signed in to change notification settings - Fork 39
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
chore: better suggestions for blacklist / whitelist #45
base: main
Are you sure you want to change the base?
Conversation
@Nytelife26 Thanks for raising this. It's good to get the perspective of folks beyond just English speakers. One question though: It wasn't entirely clear to me from the linked issue, are you advocating for:
or were those intended to be illustrative? |
@Nytelife26 Apologies... I parsed this first time through as an Issue and not a PR, and thus didn't look at the actual change you proposed, which is clearer :) |
Mostly illustrative. Solutions for this are not yet concrete, but it is clear to see that the ones decided upon currently are not suitable, and so we need new ones. Those solutions sacrifice brevity for clarity, which shouldn't be an issue, but I believe people may encounter issues with the verbosity. Any better ideas are always welcome and appreciated, so long as they are not more harmful than good.
No worries! Always glad to see people asking questions - it means they're paying attention and willing to see from the other side. |
Sorry, didn't mean to comment inline with the specific change being discussed...
Well, any example would ultimately have something to do with cases where the author of the material (or the interface/mechanism being described) for some reason has a sort of double-negative situation. So, if the default behavior is to prevent access except in special cases... One other dimension to this discussion is the distance between the two (opposing) words. Ever get tired of telephone menu systesm that say hit Am wondering the same thing about these word pairings. |
That depends largely on context. Ultimately, that's the point of calling them "inclusion list" and "exclusion list" - they are inclusive or exclusive of an entity from any given situation at hand, which is exactly what "whitelist" and "blacklist" meant, only clearer.
I am not entirely sure, actually. The I cannot weigh in on dyslexia though, as I do not have it, and I fear that speaking to people I know that do have it will result in the "{x} people do not speak for everyone" argument. "Authorization list" and "prohibition list" would work fine even if not, but be less accessible to those of lower English proficiency - although, at least translatable. |
Ok, that make sense.
While I am inclined to believe that, I am not a linguistics/hearing subject matter expert. So, I wonder if we have better metrics than just our own (potentially biased) sense of things. But, this does raise a key point, for me anyways...that I've only been thinking in terms of written word and the truth is, people will also need to speak and hear these words and we should be operating with some sensitivity to that as well. Up until this dialog, I had not.
I have number dyslexia 😄 (which I think is now called dyscalculia) and word dyslexia when I am very fatigued. |
Unfortunately, I'm not entirely certain how we could do that without conducting, to some extent, a study on the matter via experiment and statistical evaluation. As far as my knowledge of linguistics go, it should be fine, but maybe you're right, it might be better to measure these things up.
Interesting point. Maybe I should add that into the accessibility metric of my evaluation framework.
In your experience, would you find the two words hard to distinguish? Once again, I know you don't speak for everybody with the condition, but I'm just curious. All the best to you with that though - my father has them too and I help him proofread his writing. |
For written word, I think my experience of confusing |
Outside of that, possibly narrow, situation, I think the risk of
spoken-word confusion goes up to at least moderate and maybe even high.
That's an interesting roadblock. I may have to trial this.
Although, under these other circumstances, is it not also likely that
the sentences will be misheard altogether? Perhaps this is my limited
view, but I am not sure how `ex` can be misheard as `in` or vice versa
unless the sound is ommitted in hearing altogether, at which point of
course one would need to request that the speaker repeats themselves.
If this is an issue we may need to incorporate spoken considerations
into our suggestions.
As aforementioned, though, these are not concrete guidelines. If
necessary when vocalizing such phrases people are welcome to use one of
the other alternatives, such as the more concise "list of banned X" or
"list of allowed X".
If anyone else has any thoughts on this, I would be greatly interested
in hearing them.
|
@Nytelife26 One other thought that occurred to me over the weekend. 'whitelist' and 'blacklist' are also used as verbs meaning 'to add something to the list of things allowed/permitted/included (or denied/excluded)'. Its seems like the conversation so far has been around those terms as nouns... do you have thoughts on them as verbs? |
do you have thoughts on them as verbs?
It occurs to me that the verb usage is actually the easiest part of the
problem - you can easily substitute with "exclude" and "include", or
"allow" and "prohibit", just as some examples. You do not ever need to
use the phrases themselves as verbs.
Does anyone say "mailing-listed", or are you likely to say "You've been
added to the mailing list"?
The same should go here. You should either use an appropriate substitute
verb, or use phrasing similar to "You have been added to the [inclusion
/ exclusion] list".
|
So, I don't wanna create roadblocks and sorry if my excursion into these other aspects did. For this PR, its likely best to remove the questions I raised regarding the potential for confusion in various settings and due to various impairments that may be present. That may be somewhat unique to the case of word-pairings intended to have opposite meanings anyways. That said, I do think consideration of those questions as part of the process of rating replacements may lend even more credibility to our final recommendations. |
So, I don't wanna create roadblocks and sorry if my excursion into these
other aspects did.
Oh, no, it's fine, I didn't mean it like that. I'm just saying it's a
very interesting perspective, and one that might add some necessary
complexity to the problem at hand.
Which is fine - if anything, that furthers the need for this change.
may lend even more credibility to our final recommendations
Definitely. We need to consider as much as possible before arriving at
something concrete. For now, though, we have a good set of suggestions.
What's the next move?
|
One thing I am inclined to start doing is gathering together some existing refs on metrics for language. @quaid already mentioned plain language. I think there are things related to measuring distance between words (written and spoken) and I would love to see something that indicates relative (or absolute) probabilities of misinterpretation (due to impairments of one form or another) of terms. If such things exist, it would be good to gather them for review to understand if/how they could be used profitably for our goals. |
Interesting. I'll look into plain language when I get the time.
Ultimately, aside from the criteria I laid out in my statement, we need
to consider interpretability, clarity, and proliferated meanings.
|
Considering translation is really important. I've worked with our translation centers as we've updated terms at IBM, providing new terms with definitions and other explanatory information. We now have a full set of approved equivalents (translated terms) for "allowlist" and "blocklist" in all of the languages we translate to. It's always important to try to choose terms that make sense and can be translated effectively, but providing terms up front with supporting material is also a good way to ensure good translations. Are we going to eventually share translations of our terms? |
Considering translation is really important. I've worked with our
translation centers as we've updated terms at IBM, providing new terms
with definitions and other explanatory information. We now have a full
set of approved equivalents (translated terms) for "allowlist" and
"blocklist" in all of the languages we translate to. It's always
important to try to choose terms that make sense and can be translated
effectively, but providing terms up front with supporting material is
also a good way to ensure good translations.
Are we going to eventually share translations of our terms?
The point here was to highlight the issues with the currently used
alternatives. Having to form new translations makes things more
difficult for both the people working on these projects and the people
using the languages themselves.
I would, however, encourage IBM to release this information so others
can make use of it. Otherwise there's no point having it at all.
Either way, the alternatives we have highlighted are designed with
translation in mind. There has been and will be no need to co-ordinate
new terms in any languages because these terms are directly
translatable.
Part of the issue is that "allowlist" and "blocklist" are not valid
English themselves. If something isn't valid in its host language how
can we translate it to any targets?
Thank you for sharing, and weighing in as an employee under a high
profile enterprise. If IBM has any intention to release its findings
to the public, that would be great.
|
I am not the convinced the valid-english-ness of these examples is either very much true or a very strong argument as a general principle. I worry that so much of tech-industry language is of this ilk that seeking onl valid English replacements may too significantly limit our choices. |
either very much true or a very strong argument as a general principle.
You will notice something about compound nouns in the verb + noun form
\- they are all separated by a space (two words, rather than joined
together) and are almost all use the `-ing` form of the verb. English
does not have many set rules, so it's more about what makes the most
sense.
However, neither "allowlist" nor "blocklist" conform to these standard
principles. In general, they do more harm than good, as stated earlier
in this thread. Even saying "a blocked list" would mean to block the
list rather than its contained items. A *blocking* list, however, might
make sense, but still sounds somewhat unusual.
This is a problem that persists from the original terms themselves, too.
We have the chance to fix it now we are shifting away from the
originals,so we should.
only *valid English* replacements may too significantly limit our choices.
The issue here is that English is not the only language on Earth. By
treating it as such, we create linguistic proficiency barriers for those
of foreign backgrounds and also of lower proficiency in general.
The terms you create in a language should conform to the language they
are intended to be used in. Not following that both creates unusual and
invalid constructs in the host language, but also makes them impossible
to translate without arranging replacements in target languages, as
kindly reinforced by Carla.
|
Not impossible to translate, but we like to establish and record equivalents for terms so that we can have consistency. I'm not sure what the best format to share translations is -- we'll need to set something up in the spreadsheet. I can post the translations here if there's immediate interest |
If you need to find a synonym instead of being able to translate terms directly, the terms are, by definition, not directly translatable. That's fine of course, but unnecessary and inaccessible compared to terms that are directly translatable. Not to mention there isn't a reverse map for these other languages. But I see your point.
That would be grand, thank you. |
@Nytelife26 I think we misunderstand each other. I provided "blocklist" and "allowlist" to our translators and these are the terms they translated. These terms are directly translatable. Here are the translations for "blocklist". I'll post more once we figure out the best way to do it. Brazilian: lista de bloqueios |
Quite a few of those are not direct translations, which is what I was pointing out. For instance, the Spanish "lista de elementos bloqueados" is "list of blocked items" - a similar phrase, and one that would be better suited, but not a direct translation of blocklist. It is also admittedly an unusual construct, since Spanish is a very contextual language, but I suppose understandable. I obviously do not know every language you have provided, but many are not direct - and it makes more sense to use already existing terms with more clarity and the same meaning rather than proliferating the language and making new ones. |
Status on this, anyone? |
For a broader scope of discussion, I'll be making a similar pull request into Chromium's guidelines later, so we'll see how that goes. Thank you to everyone that has participated so far. |
Just curious but given that organizations like Intel, IBM and Google (I mention those as examples of organizations with international scope) have already indicated that differing contexts/communities likely require different replacements anyways, how big a priority do we think direct translation should be? |
Just curious but given that organizations like Intel, IBM and Google (I
mention those as examples of organizations with international scope)
have already indicated that differing contexts/communities likely
require different replacements anyways, how big a priority do we think
*direct translation* should be?
It may not be direct, but the fact is, those aren't translations at all.
Blocklist and the like are not translatable without first knowing what
they mean in their host language and then using the closest synonymous
representation that has a translation. Vaguely equivalent to me creating
a term, for instance, banentry, that does not work in its host language,
and then claiming it is fine because you can infer what it means and it
can be translated through "prohibition list entry" instead.
It doesn't work like that. Not to mention that, in instances where a
human translator is not available, machine translations (like google
translate itself) MUST be able to bridge the gap.
|
@Nytelife26 I think I tend to think of the distinction you are making in terms of translation vs transliteration. You can't transliterate an idiom (and I think your underlying point is to many terms are idiomatic). If you translate the component parts of an idiom you lose the meaning. So you are forced to increase the vocabulary of translations to include the idiomatic phrases themselves... which can be done, but is both much more work (especially across multiple languages and idioms) but also much more error prone. |
@Nytelife26 I *think* I tend to think of the distinction you are making
in terms of translation vs transliteration. You can't transliterate an
idiom (and I think your underlying point is to many terms are
idiomatic). If you translate the component parts of an idiom you lose
the meaning. So you are forced to increase the vocabulary of
translations to include the idiomatic phrases themselves... which *can*
be done, but is both much more work (especially across multiple
languages and idioms) but also much more error prone.
Precisely. It is more work, less accessible, more error prone, and
creates an unnecessary blockade for people - which is something we
should strive to avoid when there are better alternatives.
|
Are there any further discussions to be had about this? The verdict seems pretty clear so far, but I am open to further critique or questioning. |
@celestehorgan @justaugustus Has this been raised in the language workstream? |
We are just beginning our discussion of terms in the language workstream, so this discussion seems a little premature. |
We are just beginning our discussion of terms in the language
workstream, so this discussion seems a little premature.
Where does this take place? I would be interested in participating if
it's an open thing.
|
@Nytelife26 the language workstream is indeed open :) Their meetings and comms channels are listed here: https://inclusivenaming.org/workstreams/ :) |
Languages other than English may not allow turning a noun into a verb with the same agility that English has. Some of them might not form compound words easily and resort to English loan words. In such a scenario, it would not be impossible for the translator to revert "allowlist" into a loan word based on "blacklist", because that's a more commonly known word to his audience of non-English speakers. This would make the effort to use inclusive language vain. Therefore, emphasize more the possibility to use not just the "-ed" form of verbs, but also the verbs themselves without the "list" suffix. Their past participles easily work as adjectives, and they are easily translated. Also propose "valid" and "invalid" as replacements for whitelisted/blacklisted as an adjective. See also: inclusivenaming#45
Languages other than English may not allow turning a noun into a verb with the same agility that English has. Some of them might not form compound words easily and resort to English loan words. In such a scenario, it would not be impossible for the translator to revert "allowlist" into a loan word based on "blacklist", because that's a more commonly known word to his audience of non-English speakers. This would make the effort to use inclusive language vain. Therefore, emphasize more the possibility to use not just the "-ed" form of verbs, but also the verbs themselves without the "list" suffix. Their past participles easily work as adjectives, and they are easily translated. Also propose "valid" and "invalid" as replacements for whitelisted/blacklisted as an adjective. See also: inclusivenaming#45
I will be discussing this at the next INI meeting, as I am now a member of the language workstream. |
Now that we've started our process for writing recommendations, this should almost certainly come up in the list of terms to discuss. This should be resolved soon :) Thank you all, and there are some amazing people to work with here. I do not intend to make this my first and last contribution. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Good alternatives came out of the discussion, ready to merge and move on.
See Google's v8 PR and the KCS open statement for more information.