Draft Version 0.1
As i look at another etymology (church as it happens), I know I set myself rules but have so far not laid them out. So here is my first attempt:
- The most likely language is an early form of the current language.Whilst we do not know how languages developed, we know that much of the words in any language are "fellow passengers" on the same journey through time and space. Therefore, it follows, that what happened to one word is likely to have happened to much of the rest of the language. So, if words were related in the past, they will still tend to be related. Therefore the first place to study words are in the same language where they are found, because we expect other related words with similar phonetics and similar meanings. And once we understand the "family" of words and concepts, we have a sense of its place in the language.
- Indigenous words tend to be deeply rooted within the language
That is to say, there tend to be many closely related words, and these words in turn can be seen to be closely related to others and these in turn to others - until the meaning is so distant that these distant words do not appear to be related, yet a link of possible related words can be found. And whilst each individual relationship may be tenuous or uncertain, the wealth of such links is itself very strong proof of an ancient lineage.
- Intrusive words are not deeply rooted within the language
In contrast, words that have been borrowed from other languages - particularly very distantly related ones - or made up words or words whose meaning changes fundamentally due to a massive change in technology, will appear to be "isolates". It will be difficult if not impossible to find closely related linguistic words with close meanings.
- Overloaded linguistics
It appears to be a feature of language that whilst we expect close words to have close meanings, there appears to exist several different families of such closely related words in any area of "linguistic space". An example that comes to mind is "Drag" and "Dryg" (dry/drug) and "Dreog" (cause to happen) in Old English. Each of these have several closely related forms suggesting they are indigenous, but they are also "intrusive" in the sense they all share the same linguistic space and so are intrusive into each other's linguistic space. I have found that within Old English, there are typically 2-4 overlapping families of concepts in any linguistic space. This strongly suggests that these different families are in some sense "intrusive" but because of the size of their families, it suggests that their intrusion into "each other's" space occurred at some great age.
- Language flow is from deeply rooter → shallow
Words which are indigenous in a language are deeply rooted, and words which are intrusive are not. It therefore follows that if a word is to be said to "derive" from another language, then it must be shown that the word is more deeply rooted in that supposed "parent" language. If, in contrast, the relationship is the other way around (and Car is one I recall as being clearly intrusive into Latin), then rather than saying "barbaric XXX from Latin XXXium", one must say "barbaric XXX first recorded in its Latin form XXXium
- An word which is deeply rooted is indigenous unless or until its origin elsewhere is proven
One of the most annoying habits of those antiquarians who produced much of the etymology of words is that having assumed that English words must come from some "superior" language like Latin or Greek, or as a last resort, French or worse German, they would look at length in those languages and then if there were the merest hint of a word in these languages, they would categorically state that it was from those languages. That was absurd nonsense, putting the arse before the tit: asserting that words are "foreign" unless proven to be "native". In contrast, it is clear to me that words are native if they are "deeply rooted" and only foreign if they are more deeply rooted in a foreign language.
So, e.g. taking "drug". The word is present in Old English as "Druge" meaning dried. Yet the supposed etymology is from a much later text where there is a similar Dutch word meaning dried and the obvious Old English derivation from a closer word and closer language is completely ignored in favour of a more distant and therefore less likely origin. There might be some sense in this if the word "Drug" had not been derived from a word meaning "Dried" but instead one meaning "medicine". But when the same meaning and virtually the same word are present in both the supposed "origin" and the original language, at a time of poorly recorded texts in a context of "folk remedies" where such words might have been repressed, then it is absurd to jump to a foreign derivation unless it is necessary.
Specifically for early British linguistics:
- Old English is the most likely language of England: There is no evidence whatsoever that the language broadly in the area of modern English wasn't an early form of Old English.
(Which to distinguish it I suggest is called "British Germanic".
- Welsh-like languages are most likely in welsh-like language speaking areas: There is also no evidence that Welsh was not spoken in the areas on Nennius "left hand side of Britain". (Cornwall, Wales, Cumbria and SW Scotland)
- Celtic is a myth and the Gaelic cymbric "split" is much too old to use some hypothetical common language in any etymology. Contrary to what we are told, Gaelic is not closely related to Welsh. These two are far more dissimilar than other any other within a "family" of languages like e.g. Germanic. They may be closer than e.g. Greek and Latin, but as these are still two distinct language (groups) as far back we have written texts, it is very likely that Gaelic and Welsh were separate languages throughout all European recorded history.
- Indo-European should never be used in etymologies. Indo-European is a false concept of language development in the sense that it ignores the development of words from within languages and the frequent interchange of words between languages. It owes much of its concept to the same kinds of thoughts that led to the Nazi "Arian master-race". That in itself is not the problem. The problem is that all supposed "indo-European" words are formed by "dumbing down" to the lowest common denominator, which often means that a "word" is the only common letters of various languages and so little more than two very loose consonants with some kind of vowell between. So, e.g. a word can be considered to fit if it starts with t,d,th. has some kind of vowel in between and then ends with e.g. p, b, v, f. As there are only around a couple of dozen consonants, by the time you allow big groups of consonants to be "the same" like this the total number of possible variations in the whole languages drops to around 6 x 6 = 36 unique words in this supposed "language". In contrast, the Wikipedia article lists some 180 "Indo-European" words meaning that for most "derivations" there is choice of half a dozen words that "could be" the "origin".
- Words which derive from each other (tend to) get closer in form, the closer they are geographically and the closer in time.
In contrast to the Indo-European meme which says a word in Greek is equal to a word in German for deriving an English etymology, the common sense approach is to give priority to languages that are close and texts that are contemporary. So, e.g. if you were looking for the origin of "druid" in the Gaelic-Cymbric group, then as the evidence points to a mainland Britain origin, then if it were from this group, we would expect to find the closest words in Welsh and not Irish. And if we do not find the closer form is in the closer language then far from being "proof" of it being "Celtic", it is actually a strong indicator that the etymology is likely false.
That if words are related to each other in one language then because there is evidence that sounds change but it seems that the relationships appear to be stable, that we should find very similar relationships of words in very distantly related languages. So, e.g. if we pick the words "Bill" and "Ben". Then transform these with the following extraordinary perverse rules: B→S; i→o; e->aa; ll->K; n→q, we find that Bill → Sok and Ben goes to Saaq, but the two still share common linguistics as "sok and saaq".
So, words that are close to start, will tend to remain close even after massive linguistic changes that make them indistinguishable from their original form.