Questions and Answers

After the inception of the PreBabel site on July 14, 2009, it has caught many people's interest. An in-depth discussion on the PreBabel took place at "conlanger bulletin board." Many great questions and critiques were discussed there. The following is a brief summary of those discussions.

Page 1:

Page 2:

Page 3:

Page 4:

Page 5:

Page 6:

Page 7:

Page 8:

Page 9:

Day fifty-one -- Constructed linguistic universe (IV)

Many words can represent many distinct parts of speech.
The correct part of speech for many words cannot be decided without understanding the semantics or even the pragmatics of the context.

Day fifty-two -- Constructed linguistic universe (V)

Axiom -- it is a non-logical axiom and is selected arbitrary. Its purpose is to demarcate a domain.
Hypothesis -- it is a statement which must be proved, generally via a theory.
Postulate -- it is a statement that is assumed to be true without proof and to serve as a starting point for proving other statements. In practice, a postulate must have enough evidences to support (not to prove) its validity.

Postulate one -- the "Operator of pidginning" transforms a language Lx toward the direction to the "type 0" language.
Definition 9 -- the "Operator of pidginning" transforms a language Lx to a pidgin (Lx).
Postulate two -- the "Operator of creoling" transforms a pidgin (Lx) toward the direction to the "type 1" language.
Definition 10 -- the "Operator of creoling" transforms a pidgin (Lx) to a creole (Lx).

Day fifty-three -- Constructed linguistic universe (VI)

The pre-word layer (pw - sphere) -- this sphere is, in fact, not defined thus far in this constructed language universe. Yet, it will be the vital sphere for PreBabel. And, it will be added later.
The word/sentence layer (ws - sphere) -- this sphere has three sub-layers
1. the word sphere
2. the phrase sphere
3. the sentence sphere
This ws-sphere is governed (or delineated) by two operators, "Operator" of composite (Opc) and "Operator" of dot (Opd).
The post-sentence layer (ps - sphere) -- this sphere is context and culture laden or centered. In fact, the Sapir-Whorf hypothesis is defined in this sphere, and thus it is a major interest of our discussion. This ps-sphere is governed by the "Operator" of accumulation (Opa).

There are different languages which have different language structures, ranging from "type 0" to "type 1".
By introducing two operators, "Operator of pidginning" and "Operator of creoling", the great divide between the "type 0" and the "type 1" can be bridged over. That is,
1. The "type 0" is the ground (or default) state.
2. The "type 1" is the excited (or higher energy) state. And, the transition between the two states can be achieved with two operators, "Operator of pidginning" and "Operator of creoling". Thus, a "hypothesis one" is suggested.

Day fifty-four -- Constructed linguistic universe (VII)

Definition two: Set Vx = {syx; syx is a symbol in Lx}.
Definition three: Wx is a "word" in Lx if and only if the following two conditions are met.
1. Wx is a syx of Lx.
2. Wx has the following attributes:
  1. Wx has a unique topological form.
  2. Wx carries, at least, one unique completed sound note.
  3. Wx carries, at least, one unique meaning.
    ...

Definition six: Sx is a "sentence" in Lx if and only if the following two conditions are met.
1. Sx must have, at least, two wx. That is, Sx = Opc (syxa, syxb, ...).
2. Sx must be an operand of Opd. That is, Sx = Opd (Opc (syxa, syxb, ...)).
Note: Definition 6.a -- If Sx has only one wx, Sx = Opd (wx) is a "degenerated" sentence.
Definition seven: Px is a "phrase" in Lx if and only if the following two conditions are met.
1. Px must have, at least, two wx. Px = Opc (syxa, syxb, ...)
2. Px must "not" be an operand of Opd.

the word sphere
the phrase sphere
the sentence sphere

Day fifty-five -- Summary of constructed linguistic universe

The objective -- instead of analyzing the "real" linguistic universe which is very complicated with many chaotic data sets, I am simply constructing a "constructed linguistic universe" with some arbitrary selected definitions, axioms, postulates, etc.. Then, I will compare these two universes item by item. If I can show that the "constructed linguistic universe" does encompass the "real" linguistic universe, then a "Super Unified Linguistic Theory" is constructed.
The constructed linguistic universe --
1. Five definitions:
  1. Definition one -- the set UL, it encompasses "all" languages, Lx, Ly, ....
  2. Definition two -- the set Vx, it encompasses all symbols of "one" language, Lx.
  3. Definition three -- the words
  4. Definition four -- the phrases
  5. Definition five -- the sentences
  These five definitions demarcate a linguistic universe.
2. Three operators --
  1. Operator of composite
  2. Operator of dot (completion)
  3. Operator of accumulations
  These three operators delineate a three layer (sphere) hierarchy.
  1. the Pre-word sphere
  2. the word/sentence sphere
  3. the post-sentence sphere
3. Six axioms --
  1. Similarity transformation axiom -- Sa
  2. Predicative axiom -- Pa
  3. Inflection axiom -- Ia
  4. Redundancy axiom -- Ra
  5. Non-Communicative axiom -- Na
  6. Exception axiom -- Ea
  These six axioms identify the language type, "type 0" and "type 1". Then, can this great divide between these two types be bridged over?
4. Introducing the concept of "apostrophe," the type degeneration or deviation.
5. Two more operator:
  1. Operator of pidginning
  2. Operator of creoling
  Two postulates:
  1. Postulate one -- the "Operator of pidginning" transforms a language Lx toward the direction to the "type 0" language.
  2. Postulate two -- the "Operator of creoling" transforms a pidgin (Lx) toward the direction to the "type 1" language.
  Two predications:
  1. Predication one -- the difference of the language structure in terms of "language type" between two pidgins is smaller than the difference between two original languages.
  2. Predication two -- The difference of the language structure in terms of "language type" between two creoles is smaller than the difference between it and its parent language.
6. One more definition and two more postulates
  Definition on functionally equal
  Postulate three: the major known natural languages, at least the Big 6, are functionally equal in the ws-sphere.
  Postulate four: the Transitive Property holds for the (=F=), the functional equal.
Conclusion and comparison:
1. Hypothesis one -- this "constructed linguistic universe" forms a linear language spectrum, ranging from the "type 0" to the "type 1". That is, all natural languages are distributed in this language spectrum, and this "constructed linguistic universe" encompasses the entire "real" linguistic universe.
2. Theorems -- all theorems of this "constructed linguistic universe" are applied on the "real" linguistic universe and to see whether they hold or not.
  1. Theorem 1: English is a "type 1" language.
  2. Theorem three -- the syntax sets of two natural languages are functionally equal.
  3. Theorem 4 -- the word sets of two natural languages are functionally equal.
3. Hypothesis two -- the PreBabel principle.

Day fifty-six -- Discovering the PreBabel principle

There are three different vocabulary types.
1. Type A -- chaotic data set, most of the member of the set are stand alone without any logic or genealogical connection with other members.
2. Type B -- axiomatic data set, the "entire" (not partial) set can be derived from:
  1. a finite number of basic building blocks,
  2. a finite number of rules.
3. Type C -- a hybrid data set, the mixing of type A and type B.
There is an unsolved problems in linguistics, listed in Wikipedia.

[quote="Wikipedia"] What fundamental reasons explain why ultimate attainment in second language acquisition is typically some way short of the native speaker's ability, with learners varying widely in performance?[/quote]

With this new discovery, this unsolved problem is, in fact, removed. Please read the article "The New Paradigm of Linguistics," at;
http://www.chinese-word-roots.org/nparadi.htm
With the discovery of the PreBabel principle,
If we can find a PB set, and PB (=F=) Wx (Chinese); PB is functionally equal to the entire Chinese character set. With the "postulate 4", the transitive of (=F=),

Wx (Chinese) (=F=) Wy (English)
PB (=F=) Wx (Chinese)
then, PB (=F=) Wy (English)
That is, Wy (English), all English vocabulary, can also be encoded with PB.

Now, we have reached the starting point for PreBabel.

Thus, a "Law 1" is discovered.
Law 1: Encoding with a closed set of root words, any arbitrary vocabulary type language will be organized into a logically linked linear chain.
This is done with a "Regressive encoding," for example;

electricity (lightning, energy)
lightning (rain, energy)
rain (sky, water)
sky (above, mountain)
above (dot, horizontal bar)

dot, horizontal bar, mountain and water are roots. This "Regressive encoding" process entails two steps;
1. every word is linked to two (maximally three) other words.
2. the final destination is the closed root set.
Note 1: logically linked linear chain acts as a chain or a system of logically linked mnemonic.
Note 2: a closed set means that the parts (radicals) of all vocabulary of a language will not contain any symbol beyond (or outside of) the given root word set.
A "Law 2" is also discovered.
Law 2: When every natural language is encoded with a universal set of root words, a true Universal Language emerges.

Day fifty-seven -- Benefits of PreBabel

It revolutionizes the way of language acquisition.
It creates a true universal language.

deposit the information
recall the information

temporary (or short term) memory, such as the RAM
Long term memory

anchoring -- burn-in the information and its indexing file
webbing -- associating the new information with the anchored data, and this reduces the burn-in energy and the recalling efforts for the new information.

the verbal is learned with brutal anchoring efforts without any previously anchored base.
the written is learned with the verbal as the anchored base.

Chinese college graduates learn about 6,000 Chinese characters.
Let memory energy on these 6,000 written words be 100
Let memory energy on these 6,000 words on verbal (word sounds) be 100

Only 220 roots (+50 variants) needs to be memorized with the brutal anchoring efforts. That is,
220 / 6000 = 0.037 = 3.7%
Yet, these 220 are much easier than any of the 6,000.
The 300 sound modules can be learned as derived words, and the effort is about 1/10 of by learning with the old school way.
(300 / 6000) x (1/10) = 0.005 = 0.5%
The remaining 5700 words are all derived words from the above (220 + 300), and the effort is less than 1/100 (in average) of by learning with the old school way. Note: after one point (about 1,000 words learned), zero energy is needed.
(5700 / 6000) x (1/100) = 0.0095 = 0.95%

Reduce a huge data set to a very small root set, and thus reduce the memory energy about 95%.
Provide a memory anchor for learning the verbal in learning the second language.

Day fifty-eight -- the PreBabel procedures

How to PreBabelize a word which is unique to a language?
How to PreBabelize words which have unique relations in a language?
How to PreBabelize words which are constructed with a unique culture tradition (with special myriad prefixes and suffices) in a language?

Encoding a giving language, and it again has three sub-steps.
1. Ciphering the vocabulary -- that is, every symbol in that language is ciphered. if "du" [you], then "ev" = "du" also means [you], and "Sie" [you] = "Thf" [you]. If there are another million [you] in German, there are a million ciphers for [you] in German. There is not a single difference between the original German and the ciphered German in terms of its structure.
2. "Before" the ciphering, every word is encoded with two (maximally 3) of its own words with a "regressive encoding process". In fact, this is a dictionary process. In dictionary, a word is explained, in general, with a sentence or with a synonym. In this PreBabel process, a word is encoded with two words of the same language. That is, we are "making" every vocabulary carries its own dictionary, nothing else and nothing to it.
3. Only at the "final" stroke, a very small set of the Generation 1 (the bottom base) words are encoded with the PreBabel root set. This encoding might not be all that intuitive, such as, the (dot, stop) = "at". Then, all words are "progressively ciphered." Note: the issue that "at" can perform hundreds different kinds of acts, the (dot, stop) can do the same as it is simply a cipher for "at". The internal meaning of (dot, stop) has nothing to do with its external performances. It is simply a mnemonic dictionary for the word "at."
These three sub-steps are done internally in a given language. And thus, all the unique linguistic and cultural features are completely (100%) preserved in its PreBabelized system.

Because that every word carries its own dictionary, the PreBabelized system revolutionizes the way of language acquisition.
Emerging the PreBabel (Proper), the true universal language -- after many languages are PreBabelized, they are sharing the same PreBabel root set for their "word forms." And, they form a big mixing pot. Every PreBabel (language x) becomes a dialect of this big mixing pot. Although the PreBabel (language x) is 100% linguistic and cultural centered in the language x, the mixing pot can sort out the conflicts and remove the duplicates. Then, the PreBabel (proper) will emerge. This process can begin after two PreBabel (language x) are done.

Day fifty-nine -- about Chinese Etymology

http://www.chinese-word-roots.org/nparadi.htm

It takes 5 to 10 school years for native Chinese kid to learn 3,000 to 5,000 Chinese words while an American kid can do the same in six months by learning the PreBabel (Chinese).
About 95% of those educated Chinese people are still semi-illiterate in terms of Classic Chinese language as they still don't truly know the meaning of each word by learning the old school way. The verbal Chinese language uses "phrase" as a word while the Classic language uses word as word. Only by learning PreBabel (Chinese), the student will learn the meaning of the "word."

[quote from Trailsend] Gosh darn it, that'll be troublesome...the Chinese program at my university is still teaching simplified.[/quote]

http://www.chinese-word-roots.org/cw10.htm

[quote from sangi39] At least in terms of the UK high school education system a person learns the spoken and written language (speaking, listening, reading and writing) over 2 hours per week. In a given educational year a student will typically attend around 39/40 weeks of school and therefore 78-80 hours of language lessons per year totaling 234-240 hours of language lessons by the end of the pre-GCSE stage lessons at which point, assuming they were taught well and learnt well, they will be able to comprehend and use the spoken and written language to at least a basic degree.[/quote]

[quote from sangi39] Although I can't read the Chinese replies, the English replies all seem to be what Trailsend termed "polite dismissals". They all seem to follow the same general pattern of "we'll pass it on" and "it was interesting" ... there is very little suggesting they'd go further than this.[/quote]

[quote from Trailsend] I am not. (If I was, I wouldn't be quite so concerned) My understanding of the situation is limited to news stories I was able to find online, and what I was able to dig up on the English section of the Chinese government's webpage, including this article from the government's site, dated from August of this year, which didn't seem to suggest that there would be a movement back toward the traditional system.[/quote]

http://www.chineseetymology.com/exhibite.php

[quote f rom Khagan] Tienzen , I am very interested in getting your book "Chinese Etymology". Is it really $400 for the 305 page paperback though? If so, why is it so expensive?[/quote]

It will cost a person $1,000 a year for going to an old school. Thus, 5 x $1,000 = $5,000
It will save a person, at least, 4 years. This is very valuable.
It will provide knowledge which can never be learnt anywhere else in the world. With the old school way, one person learns 3,000 Chinese words, and he knows those 3,000 words. Any new word will be an unknown word. With my Chinese Etymology, he learns 3,000 words, and he will know all (about 60,000) Chinese words. This is a value beyond any calculation.

[quote from Khagan] If your etymology is correct and factually based, it ought to be something even amateur linguists could partially recreate. And even if they do it imperfectly, my understanding is that well less than 10,000 characters are required for general literacy in Chinese.[/quote]

[quote from Khagan] ... the $400 list price is almost certainly a guarantee of perpetual obscurity for it ... [/quote]

[quote from Khagan] How familiar are you with any language that is neither English nor Chinese? PreBabel gives the impression of heavily inflecting/agglutinating languages not having been given much thought. [quote]

English/Chinese --- completely different "types" of languages
Japanese/Chinese --- Japanese is not in Chinese family but is very, very heavily "influenced" by Chinese, and it goes way, way beyond the importing of a foreign language.
Spanish/English --- two languages of a language family.
Hunanese and Taiwanese --- dialects of a family

An inflected word can be separated into two parts, the body and the tail.
i-word = b-word (body) + tail
Only b-word is PreBabelized, and the tail stay unchanged.
A PB tail set is generated, and every i-word becomes a word phrase.
i-word = PB (b-word) + PB (tail), the i-word becomes a PB phrase.
With a de-inflection process.

Day sixty -- Can the parts be larger than the whole?

[quote] Can the parts be larger than the whole? [/quote]

a big chunk of ice, about 10 times bigger than the visible iceberg.
a large body of water (ocean or a large lake)
a big space above it

People --
- Races
- ethnic groups
- languages
- religions
- history
- etc.
land
- Asia
- Europe
- Africa
- etc.
etc.