Super Unified Linguistic Theory
PreBabel --- The true Universal Language
Copyright © December 2009 by Tienzen (Jeh-Tween) Gong
Abstract:
The differences among natural languages are seemingly great. Yet, are these differences superficial or fundamental? Without a final theory on linguistics which encompasses all natural languages, this question cannot be answered. Yet, no such a final theory was in sight. However, there is a short-cut to this issue. Instead of constructing a final theory, we can construct a virtue linguistic universe. These two approaches are dramatically different. Theory is always "hypothesis" centered. The constructed universe can be built from the bottom up with some arbitrary definitions without any hypothesis. Then, this constructed universe must be checked with the "real" universe, item by item (its theorems, laws, phenomena, etc.). The theoretical basis on this was discussed in detail at the "PreBabel Dialogue,
http://www.prebabel.info/pqna005.htm#day42 ".
With a constructed linguistic universe, the Super Unified Linguistic Theory which encompasses all natural languages is emerged. Furthermore, the PreBabel principle which begets the true universal language is discovered.
Introduction and the table of content:
- Definitions -- five definitions and three operators
- Five definitions:
- Definition one -- the set UL, it encompasses "all" languages, Lx, Ly, ....
- Definition two -- the set Vx, it encompasses all symbols of "one" language, Lx.
- Definition three -- the words
- Definition four -- the phrases
- Definition five -- the sentences
These five definitions demarcate a linguistic universe.
- Three operators --
- Operator of composite
- Operator of dot (completion)
- Operator of accumulations
These three operators delineate a three layer (sphere) hierarchy.
- the Pre-word sphere
- the word/sentence sphere
- the post-sentence sphere
- Six axioms --
- Similarity transformation axiom -- Sa
- Predicative axiom -- Pa
- Inflection axiom -- Ia
- Redundancy axiom -- Ra
- Non-Communicative axiom -- Na
- Exception axiom -- Ea
These six axioms identify the language type, "type 0" and "type 1". Then, can this great divide between these two types be bridged over?
- The structure of a constructed linguistic universe
- Three layers of hierarchy
- the Pre-word sphere
- the word/sentence sphere (context free)
- the post-sentence sphere (context centered)
- Language types, ranging from "type 0" to "type 1"
- Comparison with the real linguistic universe
- Introducing the concept of "apostrophe," the type degeneration or deviation.
- "Type" algebra (type operation table)
- Comparing English and Chinese
- The language spectrum -- ranging from "type 0" to "type 1", and all languages are distributed in this language spectrum.
- Two more operators:
- Operator of pidginning
- Operator of creoling
- Two postulates:
- Postulate one -- the "Operator of pidginning" transforms a language Lx toward the direction to the "type 0" language.
- Postulate two -- the "Operator of creoling" transforms a pidgin (Lx) toward the direction to the "type 1" language.
- Two predications:
- Predication one -- the difference of the language structure in terms of "language type" between two pidgins is smaller than the difference between two original languages
- Predication two -- The difference of the language structure in terms of "language type" between two creoles is smaller than the difference between it and its parent language.
- Operator of (=F=), the functional equal
- Definition of (=F=), functionally equal
- Postulate three: the major known natural languages, at least the Big 6, are functionally equal in the ws-sphere.
- Postulate four: the Transitive Property holds for the (=F=), the functional equal.
- Linguistic theorems
- Hypothesis one -- this "constructed linguistic universe" forms a linear language spectrum, ranging from the "type 0" to the "type 1". That is, all natural languages are distributed in this language spectrum, and this "constructed linguistic universe" encompasses the entire "real" linguistic universe.
- Theorems -- all theorems of this "constructed linguistic universe" are applied on the "real" linguistic universe and to see whether they hold or not.
- Theorem 1: English is a "type 1" language.
- Theorem 2 -- the syntax sets of two natural languages are functionally equal.
Corollary 2.1 -- Any two natural languages (Lx and Ly) are mutually translatable.
- Theorem 3 -- the word sets of two natural languages are functionally equal.
Corollary 3.1 -- Wx (Chinese) has only about 60,000 characters and Wy (English) has about one million words. Yet, Wx (Chinese) is functionally equal to Wy (English).
- The discovery of the PreBabel principle.
- Law 1: Encoding with a closed set of root words, any arbitrary vocabulary type language will be organized into a logically linked linear chain.
- Law 2: When every natural language is encoded with a universal set of root words, a true Universal Language emerges.
- The PreBabel procedures -- the regressive encoding
- The Benefits of PreBabel
- It revolutionizes the way of language acquisition.
- It creates a true universal language.
- The Conclusion
I. Definitions
The followings are the definitions which demarcate the domain of a "constructed linguistic universe". Of course, this "constructed linguistic universe" will, then, be checked with the real linguistic universe, item by item.
- Definition one: Set UL = {Lx; Lx is a natural language}. So, the members of set UL are natural languages.
- Definition two: Set Vx = {syx; syx is a symbol in Lx}.
- Definition three: Wx is a "word" in Lx if and only if the following two conditions are met.
- Wx is a syx of Lx.
- Wx has the following attributes:
- Wx has a unique topological form.
- Wx carries, at least, one unique completed sound note.
- Wx carries, at least, one unique meaning.
Note: In a universe, some terms are known intuitively and are not defined. In general, these terms are known via some other disciplines. The following terms are undefined.
- Natural language
- Set, member and symbol
- topological form
- Completed sound note
- Meaning -- meaning is, in fact, a pointing function. When, F(wx) --> y, then, y is the meaning for wx.
- Definition four: "Operator" of composite (Opc) -- set Vx is the domain and the range for Opc. Then,
Opc (syx1, syx2, ...) = syxn
Note: there can be some laws for Opc, such as, the Commutative, Associative, Distributive Laws.
- Definition five: "Operator" of dot (Opd) -- Opd is placed at the utmost right position of a syx. Opc cannot have any operand which carries an Opd.
- Definition six: Sx is a "sentence" in Lx if and only if the following two conditions are met.
- Sx must have, at least, two wx. That is, Sx = Opc (syxa, syxb, ...).
- Sx must be an operand of Opd. That is, Sx = Opd (Opc (syxa, syxb, ...)).
Note: Definition 6.a -- If Sx has only one wx, Sx = Opd (wx) is a "degenerated" sentence.
- Definition seven: Px is a "phrase" in Lx if and only if the following two conditions are met.
- Px must have, at least, two wx. Px = Opc (syxa, syxb, ...)
- Px must "not" be an operand of Opd.
- Definition eight: "Operator" of accumulation (Opa) -- Only "sentences" of Lx can be the operands of Opa. Opa stacks "sentences" of Lx into a linear chain.
Seemingly these eight definitions are strange and simple. Can they truly demarcate a constructed linguistic universe? Can this constructed linguistic universe encompass the real linguistic universe?
II. Axioms
After the demarcation of a domain, we, now, can and need to construct the internal structure of this domain. That is, we need to introduce some axioms now. With different axioms, the internal structure of the domain will be different, or the different sub-domains will be constructed. I will, now, "introduce" (arbitrary chosen) six axioms for this "constructed linguistic universe." Similarly to the Parallel axioms in Geometry, every axiom can have more than one value.
- Similarity transformation axiom -- a rule (theorem or law) will repeat over and over in a domain or in different levels of its hierarchy. And, it has two values;
- Sa = 0, similarity transformation is not active.
- Sa = 1, similarity transformation is active.
- Predicative axiom -- particles in a glob (a word, a phrase or a sentence) is distinguishable. And, it has two values;
- Pa = 0, PA is not active.
- Pa = 1, PA is active.
When Pa = 1, a sentence "could" be first distinguished as the "Speaker" and the "others." If Sa = 1 also, then, the "others" can be further distinguished as,
- action (or state) words
- object (things or person) words
- pointing words, and these can be further distinguished as,
- pointing the action words
- pointing the object words
- gluing words
- others
- Inflection axiom -- some tags are tagged at the end of words. And, it has two values;
- Ia = 0, IA is not active
- Ia = 1, IA is active
- Redundancy axiom -- For a function F, it will be applied, at least, twice on its operand. And, it has two values;
- Ra = 0, RA is not active
- Ra =1, RA is active
Examples:
- Ra = 0;
I go to school "yesterday".
I have "three" dog.
I love He.
She love I.
- Ra = 1;
I "went" to school yesterday.
I have three "dogs".
I love him.
She loves me.
- Non-Communicative axiom -- for (a, b) and (b, a), they are "not" the same. And, it has two values;
- Na = 0, NA is not active
- Na = 1, NA is active
For a sentence,
- when Na = 0,
(I love you) = (love you I)
Note: If a Lx has Na = 0, it will run into some problems. Is (I love you) and (You love I) the same? Yet, there are some ways to resolve this kind of issue, and I will discuss it later.
- when Na = 1, then the "word order" is a rule.
- Exception axiom -- for every rule in the universe, there is one or some exceptions. And, it has two values;
- Ea = 0, EA is not active
- Ea = 1, EA is active
With these six axioms, a constructed language can be expressed as,
Lx (a constructed language) = {Sa, Pa, Ia, Ra, Na, Ea}
III. The structure of a constructed linguistic universe
Now, we have constructed two types of language, "type 0" and "type 1".
Type 0 = {0, 0. 0. 0. 0. 0}
Type 1 = {1, 1, 1, 1, 1, 1}
Our question is that whether there is any "real" natural language having a similar structure to these two types of constructed language or not. Perhaps, some real natural languages are hybrids of these two.
The real natural language universe is very complicated. Yet, the constructed language universe is quite simple thus far, with only 5 definitions, 3 operators and 6 axioms. Our final objective is to "derive" some languages which are similar with or identical to some natural languages. Yet, we should have a bird eyes view on this constructed language universe first. In fact, it has three layers (levels) of hierarchy.
- The pre-word layer (pw - sphere) -- this sphere is, in fact, not defined thus far in this constructed language universe. Yet, it will be the vital sphere for PreBabel. And, it will be added later.
- The word/sentence layer (ws - sphere) -- this sphere has three sub-layers
- the word sphere
- the phrase sphere
- the sentence sphere
This ws-sphere is governed (or delineated) by two operators, "Operator" of composite (Opc) and "Operator" of dot (Opd).
- The post-sentence layer (ps - sphere) -- this sphere is context and culture laden or centered. In fact, the Sapir-Whorf hypothesis is defined in this sphere, and thus it is a major interest of our discussion. This ps-sphere is governed by the "Operator" of accumulation (Opa).
Thus, each sphere is governed or delineated by operators. Now, I will discuss only the ws-sphere. And, we can "derive" some theorems and laws now.
IV. Comparison to the real linguistic universe
By comparing with the English, what is the type of language for English in terms of this "constructed language universe"?
- English is inflected --> Ia = 1
- English has a "subject -- predicate" structure --> Pa = 1
- English has parts of speech, tense, numbers, etc. --> Ra =1
- English has word order --> Na =1
For every real natural language, I think that it has Sa =1 and Ea =1. Thus, I will make this a law.
Law A: For every real natural language, it has Sa = 1 and Ea =1.
Thus, we can rewrite the language "type" equation, Lx (a real natural language) = {1, Pa, Ia, Ra, Na, 1}. Then,
Type 0 = {Pa, Ia, Ra, Na} = {0, 0, 0, 0}
Type 1 = {Pa, Ia, Ra, Na} = {1, 1, 1, 1}
Now, we should be able to prove a theorem:
Theorem 1: In comparing with the structure of English, a "type 1" language can encompass the English-like languages.
Corollary 1: English is a "type 1" language.
Then, we can compare the other real natural languages with this constructed language universe, one by one. Yet, I think that two will be enough to prove the point, and I will make such a comparison with Chinese language next.
For Sa = 1, all other axioms are either repeating or inherited in each level or sub-level through out the hierarchy. Thus, the language "type" equation can be and should be written in better details, such as,
Lx (a real natural language) = word {Pa, Ia, Ra, Na} + phrase {Pa, Ia, Ra, Na} + sentence {Pa, Ia, Ra, Na}
For Chinese language,
Pa = 0 for all levels.
Ia = 0 for all levels.
Ra = 0 for all levels.
Yet, for Na (the Non-Communicative axiom), it is not a (0, 1) operator but is a fuzzy operator. And this fuzzy operator goes way beyond the coverage of Ea (Exception axiom).
For Chinese words, the Na basically equals to zero (0), but its exceptions go way beyond the Ea can cover. Thus, I must introduce a new concept, the "apostrophe", 0' which is basically a 0 but with exceptions go way beyond the Ea can cover.
For Chinese phrases, the Na basically equal to 1'; the word order of phrases does make difference most of the time.
For Chinese sentences, the Na basically equals to 0'; the word order of sentences does "not" make difference most of the time. Such as, (I love he) = (love he I) = (he I love) = (love I he)
Thus, Lx (Chinese language) = word {Pa, Ia, Ra, Na} + phrase {Pa, Ia, Ra, Na} + sentence {Pa, Ia, Ra, Na}
= word {0, 0, 0, 0'} + phrase {0, 0, 0, 1'} + sentence {0, 0, 0, 0'}
With such a complicated equation, we should introduce an arithmetics table to calculate it. As there are three parts, we can define the operation table as below,
0 + 0 + 1 = 0'
1 + 1 + 0 = 1'
0 + 0 + 0' = 0'
1 + 1 + 1' = 1'
0 + 0 + 0 = 0
1 + 1 + 1 = 1
and, 0' + 1' + 0' = 0', so,
Lx (Chinese language) = {Pa, Ia, Ra, Na} = {0, 0, 0, 0'} = 0'
That is, the Chinese language is a (type 0') language.
Now, we can re-visit the English language. Superficially, the English words are inflected at the "word form" level. Yet,
- Many words can represent many distinct parts of speech.
- The correct part of speech for many words cannot be decided without understanding the semantics or even the pragmatics of the context.
Thus, the Ia (inflection axiom) in English is not a perfect 1, and it should be 1'. That is, the English language should be a (type 1') language. Perhaps, the (type 0) and (type 1) are ideal languages.
Now, we know the difference between two languages. Is that difference superficial or fundamental? We need to introduce two more operators to answer this question.
V. The language spectrum
What we are doing here is not only new to linguistics but is also new to science. Thus, we must make the terms that we are using very clear without any misunderstanding. The terms of axiom, postulate, assumption, hypothesis and premise are sometimes viewed as synonyms. The followings are the definitions for this work, the "constructed linguistic universe."
- Axiom -- it is a non-logical axiom and is selected arbitrary. Its purpose is to demarcate a domain.
- Hypothesis -- it is a statement which must be proved, generally via a theory.
- Postulate -- it is a statement that is assumed to be true without proof and to serve as a starting point for proving other statements. In practice, a postulate must have enough evidences to support (not to prove) its validity.
Now, I will introduce two postulates for this "constructed linguistic universe."
- Postulate one -- the "Operator of pidginning" transforms a language Lx toward the direction to the "type 0" language.
Definition 9 -- the "Operator of pidginning" transforms a language Lx to a pidgin (Lx).
- Postulate two -- the "Operator of creoling" transforms a pidgin (Lx) toward the direction to the "type 1" language.
Definition 10 -- the "Operator of creoling" transforms a pidgin (Lx) to a creole (Lx).
With these two postulates, we can make some predictions.
Predication one -- Lx and Ly have different language structures. That is, [Lx - Ly] = D1,
and [pidgin (Lx) - pidgin (Ly)] = D2, then,
D2 < D1, D2 is smaller than D1. That is, the difference of the language structure in terms of "language type" between two pidgins is smaller than the difference between two original languages
Predication two -- Lx is a natural language with a creole (Lx) and Ly with creole (Ly). And,
[Lx - creole (Lx)] = D1
[Ly - creole (Ly)] = D2
[creole (Lx) - creole (Ly)] = D3
Then, D3 < D1, D3 is smaller than D1, and
D3 < D2, D3 is smaller than D2.
The difference of the language structure in terms of "language type" between two creoles is smaller than the difference between it and its parent language.
If we can find some evidences for these two predictions, the following hypothesis is proved.
Hypothesis one -- this "constructed linguistic universe" forms a linear language spectrum, ranging from the "type 0" to the "type 1". That is, all natural languages are distributed in this language spectrum, and this "constructed linguistic universe" encompasses the entire "real" linguistic universe.
If the "hypothesis one" is true, then the difference among natural languages is superficial, not fundamental. The great divide between the "type 0" and "type 1" can be bridged over with two operators, "Operator of pidginning" and "Operator of creoling".
VI. Operator of (=F=), the functional equal
Thus far, we have made the following points.
A. The constructed language universe has three layers of hierarchy.
- The pre-word layer (pw - sphere)
- The word/sentence layer (ws - sphere) -- this sphere has three sub-layers
- the word sphere
- the phrase sphere
- the sentence sphere
This ws-sphere is governed (or delineated) by two operators, "Operator" of composite (Opc) and "Operator" of dot (Opd).
- The post-sentence layer (ps - sphere) -- this sphere is context and culture laden or centered. This ps-sphere is governed by the "Operator" of accumulation (Opa).
B. Thus far, our discussion is centered on ws-sphere, and I have reached the following points.
- There are different languages which have different language structures, ranging from "type 0" to "type 1".
- By introducing two operators, "Operator of pidginning" and "Operator of creoling", the great divide between the "type 0" and the "type 1" can be bridged over. That is,
- The "type 0" is the ground (or default) state.
- The "type 1" is the excited (or higher energy) state.
In order to prove that the "Hypothesis one" is true, we must construct a theory for it. And, I will start this with a definition.
Definition eleven (11) -- Lx and Ly are different sets (with different symbols and different numbers of symbols). Z is a Range Set. F is an (arbitrary) function.
if, F (Lx) = Z, (F maps Lx to Z)
and F(Ly) = Z, then
Lx and Ly are "functionally equal". And it is written as, Lx (=F=) Ly
With this definition on (=F=), functionally equal, we can make a new postulate.
Postulate three -- Lx and Ly are different natural languages in the ws-sphere, then
Lx (=F=) Ly
That is, the major known natural languages, at least the Big 6, are functionally equal in the ws-sphere.
Note: This "postulate three" does not cover other spheres, as the Lx and Ly might not be functionally equal in the ps-sphere which is history and culture centered.
VII. Linguistic theorems
The concept of "functional equal" is not new. But it is new as an operator in algebra and in set theory. For two sets, A and B which are not equal in algebra nor in traditional set theory but can be "functionally equal" with definition 11. Now, the internal dynamics of this "constructed linguistic universe" can be analyzed.
As the ws-sphere is governed (or delineated) by two operators, "Operator" of composite (Opc) and "Operator" of dot (Opd) and as the words, the phrases and the sentences are all members of the set Vx, the set Vx can be re-written as:
Set Vx = {syx; syx is a symbol in Lx, words, phrases, sentences}. Thus,
set Wx = {syx; syx is a word in Lx}
set Px = {syx; syx is a phrase in Lx}
set Sx = {syx; syx is a sentence in Lx}
And, set Vx = Wx U Px U Sx; (union of Wx, Px and Sx).
We now can prove some theorems.
Theorem two -- in the ws-sphere, (Lx, Vx) and (Ly, Vy) are two different natural languages, then,
Vx (=F=) Vy
That is, the syntax sets of two natural languages are functionally equal.
Corollary 2.1 -- Lx and Ly are mutually translatable.
Postulate 4 -- the Transitive Property holds for the (=F=), the functional equal.
Now, we can re-write the set Vx.
Let P is a process, the combination of Opc (operator of composite) and Opd (operator of dot).
As the process P generates the Px (phrases) and Sx (sentences), then,
P ({wx}) = Sx U Px = P (Wx)
So, Vx = Wx U P(Wx) , and I will re-write this set equation with a new convention,
Vx = (Wx, P), the Vx can be constructed by having Wx (set of words) and P (process of constructing phrases and sentences). This new convention is, in fact, an "equivalent transformation".
Now, (Lx, Vx) and (Ly, Vy) are two different natural languages, and,
Vx = (Wx, Px) and Vy = (Wy, Py)
Per theorem 2 -- Vx (=F=) Vy, the syntax sets of two natural languages are functionally equal, and we can prove a new theorem,
Theorem 3 -- (Lx, Wx) and (Ly, Wy) are two different natural languages, then,
Wx (=F=) Wy and Px (=F=) Py,
the word sets of two natural languages are functionally equal.
Corollary 3.1 -- Wx (Chinese) (=F=) Wy (English).
Wx (Chinese) has only about 60,000 characters, and Wy (English) has about one million words. Yet, Wx (Chinese) is functionally equal to Wy (English).
Seemingly, this corollary 3.1 is a commonly known old fact. Yet, when it becomes a theorem, a new logic is opened up. It, in fact, says that every English word can be encoded (or ciphered) with Chinese characters, one million words being encoded with a few thousand characters.
If we can find a PB set, and PB (=F=) Wx (Chinese); PB is functionally equal to the entire Chinese character set. With the "postulate 4", the transitive of (=F=), PB (=F=) Wy (English). That is, all English vocabulary can also be encoded with PB.
VIII. Discovering the PreBabel Principle
Thus far, the Pre-Word sphere is not defined in this "constructed linguistic universe." There is also very little study on this pre-word sphere in the "real" linguistic universe. The phonology and the morphology are subjects in the word/sentence sphere, although they might have some issues which fall in the pre-word sphere. Even the etymology is not an 100% pre-word issue. Most of the etymology discusses the evolution of the words, instead of the structure of words.
Most of vocabulary of natural languages are a type of arbitrary vocabulary which means that words are patterns of temporally ordered sound types, and meaning of a word does not attach to particular activities, sound, marks on paper, or anything else with a definite spatiotemporal locus. Some English words do arise from roots. Yet, those roots are called "root words," that is, they are words, not pre-words. Furthermore, root words encompass only a very small portion of the English vocabulary. Again, the inflection of words is the issue in the word/sentence sphere, not a pre-word issue. For Chinese words, although the "Kangsi" leading radicals are known, the body of Chinese characters, for thousands years, remains a blob, an arbitrary vocabulary type.
After the publication of "Chinese Word Roots and Grammar" in 2006 and of "Chinese Etymology" in 2008, two new linguistic principles were discovered.
- There are three different vocabulary types.
- Type A -- chaotic data set, most of the member of the set are stand alone without any logic or genealogical connection with other members.
- Type B -- axiomatic data set, the "entire" (not partial) set can be derived from:
- a finite number of basic building blocks,
- a finite number of rules.
- Type C -- a hybrid data set, the mixing of type A and type B.
There is an unsolved problems in linguistics, listed in Wikipedia.
[quote="Wikipedia"] What fundamental reasons explain why ultimate attainment in second language acquisition is typically some way short of the native speaker's ability, with learners varying widely in performance?[/quote]
With this new discovery, this unsolved problem is, in fact, removed. Please read the article "The New Paradigm of Linguistics," at;
http://www.chinese-word-roots.org/nparadi.htm
- The discovery of the PreBabel principle,
If we can find a PB set, and PB (=F=) Wx (Chinese); PB is functionally equal to the entire Chinese character set, with the "postulate 4", the transitive of (=F=),
Wx (Chinese) (=F=) Wy (English)
PB (=F=) Wx (Chinese)
then, PB (=F=) Wy (English)
That is, Wy (English), all English vocabulary, can also be encoded with PB.
Then, two laws are discovered.
- Law 1: Encoding with a closed set of root words, any arbitrary vocabulary type language will be organized into a logically linked linear chain.
- Law 2: When every natural language is encoded with a universal set of root words, a true Universal Language emerges.
These new discoveries are the major issues in the Pre-Word sphere. Please visit http://www.prebabel.info
IX. The PreBabel Procedures
Yet, are these discoveries valid? What are the benefits that these new discoveries can provide?
- How to PreBabelize a word which is unique to a language?
- How to PreBabelize words which have unique relations in a language?
- How to PreBabelize words which are constructed with a unique culture tradition (with special myriad prefixes and suffices) in a language?
The PreBabel process really has two steps.
- Encoding a giving language, and it again has three sub-steps.
- Ciphering the vocabulary -- that is, every symbol in that language is ciphered. if "du" means [you] in German, then "ev" = "du" also means [you]. If there are another million [you] in German, there are a million ciphers for [you] in German. There is not a single difference between the original German and the ciphered German in terms of its structure.
- "Before" the ciphering, every word is encoded with two (maximally 3) of its own words with a "regressive encoding process". In fact, this is a dictionary process. In dictionary, a word is explained, in general, with a sentence or with a synonym. In this PreBabel process, a word is encoded with two words of the same language. That is, we are "making" every vocabulary carries its own dictionary. The following is one example of this "regressive encoding,"
electricity (lightning, energy)
lightning (rain, energy)
rain (sky, water)
sky (above, mountain)
above (dot, horizontal bar)
dot, horizontal bar, mountain and water are roots.
- Only at the "final" stroke, a very small set of the Generation 1 (the bottom base) words are encoded with the PreBabel root set. This encoding might not be all that intuitive, such as, the (dot, stop) = "at". Then, all words are "progressively ciphered."
Note: the issue that "at" can perform hundreds different kinds of acts, the (dot, stop) can do the same as it is simply a cipher for "at". The internal meaning of (dot, stop) has nothing to do with its external performances. It is simply a mnemonic dictionary for the word "at."
These three sub-steps are done internally in a given language. And thus, all the unique linguistic and cultural features are completely (100%) preserved in its PreBabelized system.
Because that every word carries its own dictionary, the PreBabelized system revolutionizes the way of language acquisition.
- Emerging the PreBabel (Proper), the true universal language -- after many languages are PreBabelized, they are sharing the same PreBabel root set for their "word forms." And, they form a big mixing pot. Every PreBabel (language x) becomes a dialect of this big mixing pot. Although the PreBabel (language x) is 100% linguistic and cultural centered in the language x, the mixing pot can sort out the conflicts and remove the duplicates. Then, the PreBabel (proper) will emerge. This process can begin after two PreBabel (language x) are done.
X. The Benefits of PreBabel
What are the benefits that these new discoveries can provide? The PreBabelizing process provides two monumental benefits.
- It revolutionizes the way of language acquisition.
- It creates a true universal language.
Each and every natural language is just a set of data, the words (including the word forms, the word sounds and the word meanings), the phrases and the sentences. This set of data can be reduced to set L = {words, a Process}, with the process to create phrases and sentences. Thus, to learn a language is simply to "memorize" the set L.
Every memorization process (human or machine) consists of two steps.
- deposit the information
- recall the information
In order to recall the information, the information must be "indexed" with a index file. For maximizing the memorizing process, it is further divided into two steps.
- temporary (or short term) memory, such as the RAM
- Long term memory
While the computer memorization process can be done "almost" instantaneously, the human long term memory requires a "burn-in" process which is limited with the brain energy. That is, only a finite numbers of burn-in per day can be done by a brain before it is exhausted. And, learning a language is simply managing the data set L with the memory energy.
For average persons (not genius), everyone's memory energy is about the same. Thus, we can prove a theorem.
Theorem 4: Lx and Ly are two data sets. Lx is a chaotic data set with members which are not related or linked to any other member. Ly is an organized data set with members which can be derived from a small set of roots. And, Mx is the memory energy required for Lx; My is the memory energy required for Ly. Then,
My < Mx
The memory energy required for My is much smaller than for the Mx.
In reality, human long term memory consists of two steps,
- anchoring -- burn-in the information and its indexing file
- webbing -- associating the new information with the anchored data, and this reduces the burn-in energy and the recalling efforts for the new information.
For learning the first language (the mother tongue),
- the verbal is learned with brutal anchoring efforts without any previously anchored base.
- the written is learned with the verbal as the anchored base.
For learning the second language -- both verbal and written must be learned with brutal anchoring efforts without the help of any previously anchored base. Thus, the ultimate attainment in second language acquisition is typically some way short of the native speaker's ability.
Now, we can analyze the great benefit of PreBabel process on language acquisition. Let's use Chinese language as the example.
- Chinese college graduates learn about 6,000 Chinese characters.
- Let memory energy on these 6,000 written words be 100
- Let memory energy on these 6,000 words on verbal (word sounds) be 100
That is, the total energy for learning these 6,000 words (written and verbal) is 200.
With PreBabel (Chinese),
- Only 220 roots (+50 variants) need to be memorized with the brutal anchoring efforts. That is,
220 / 6000 = 0.037 = 3.7%
Yet, these 220 are much easier than any of the 6,000.
- The 300 sound modules can be learned as derived words, and the effort is about 1/10 of by learning with the old school way.
(300 / 6000) x (1/10) = 0.005 = 0.5%
- The remaining 5700 words are all derived words from the above (220 + 300), and the effort is less than 1/100 (in average) of by learning with the old school way. Note: after one point (about 1,000 words learned), zero energy is needed.
(5700 / 6000) x (1/100) = 0.0095 = 0.95%
Thus, the total energy needs to learn 6,000 Chinese written characters with Prebabel (Chinese) is
0.037 + 0.005 + 0.0095 = 0.0515 = 5.15%
100 / 5.15 = 19.4
That is, the PreBabel (Chinese) is 19.4 times easier than the old school way.
Yet, most importantly, the above process can be done without learning the verbal at the same time which is almost impossible for the old school way. After knowing the written, the verbal can be learned with the written as the "anchor" and becomes much, much easier. This turns the language learning process upside down completely.
In summary, the PreBabel improves the language acquisition in two great ways,
- Reduce a huge data set to a very small root set, and thus reduce the memory energy about 95%.
- Provide a memory anchor for learning the verbal in learning the second language.
Learning PreBabel (English) is quite similar to learning PreBabel (Chinese). Please visit http://www.prebabel.info/bab015.htm
XI. The Conclusion
After the successful of applying the derived theorems and laws on the "real" language universe, the "Constructed Linguistic Universe" is, now, the "Super Unified Linguistic Theory." It forms a language spectrum, and all natural languages are distributed in this linguistic spectrum. Although every natural language Lx has its own Wx (word set), Px (grammar rules) and sits at its own position in the language spectrum, it is, in fact, functionally equal to all other languages. Then, the PreBabel principle and procedures were discovered. And, the PreBabel (Proper), the true universal language, emerges. Finally, it revolutionizes the way of language acquisition.
Note: This paper will be presented on Friday, April 2, 2010 at "8th. Annual SECCLL Conference", Southeast Coastal Conference on Languages and Literatures (SECCLL), Georgia Southern University.
Please visit,
http://class.georgiasouthern.edu/flseccll/index.html