Illandancient
Wikipedia:Babel | ||||||
---|---|---|---|---|---|---|
| ||||||
Sairch uiser leids |
Hi, name's Chris. I was brought up in England with a Scots speaking father, and subsequently spent ten years in Glasgow 'involved' with the local indie music scene, but now I'm settled in Hertfordshire.
In terms of being a Scots speaker, I'm a fraud.
But I can understand Scots and I have read a couple of children's books like The Gruffalo's Bairn and Asterix.
However, I do know about word frequency lists and how to faff about with data, which might be useful for comparing the Scots wikipedia lexis with other regional lexii.
I feel its a heavy burden to try to get the Scots wikipedia right. With 1.9 million native speakers, it seems odd that many of most enthusiastic wiki editors are non-Scots. There are many spellings that seem wrong. I've tried to whittle them out with data and word frequency lists, but identifying them isn't enough, the occurrences of dodgy spellings keep growing as people reach for the various online dictionaries, and say "there! the word exists, last used in 1839 but a legitimate Scots word!", despite no one ever using it in Modern Scots. Or maybe they do. I don't know. I'm not in Scotland standing next to the radge on the street, listening to their intonations as they describe global geo-politics, operating systems and such like.
- List of Pages with Scots references
- Discussion and resources about word frequencies on different language Wikis
- Comparison of word frequencies in different corpi
- Late Sep Lexis A bulletin based on word frequency comparisons
- Uiser:Illandancient/21t Scots Word Frequency Taken from a Corpus of 21st Century Scots writing
- Uiser:Illandancient/Live sources of Scots text
Wikipedia words I feel strongly about
eedit- seestem - nonsense, also solar seestem
- diskivert - what wrong with 'fund' or 'discovert'?
- aurie - its not modern scots, but its infiltrated the whole of the scots wiki
- ceety - what's wrong with city?
- beeshop - its nothing to do with religion, its shop that sells bees
Top five hunner wirds on the Sco.wiki
eeditHere's a leet of the top five hunner maist uised wirds on Scots Wikipedia with the nummer o occurrences in brackets.
the (409313) | ower (4456) | aroond (2542) | seestem (1783) | includes (1431) | steid (1172) |
an (180891) | mair (4420) | while (2529) | keeng (1777) | awtho (1422) | dounset (1172) |
in (156408) | some (4394) | touns (2525) | maria (1770) | castle (1421) | queen (1171) |
is (109320) | wast (4313) | well (2521) | third (1762) | poleetical (1416) | sangster (1165) |
tae (73354) | unitit (4264) | kinrick (2470) | general (1760) | black (1400) | til (1164) |
wis (58680) | syne (4237) | sic (2464) | end (1758) | player (1400) | considered (1164) |
as (45754) | history (4206) | afore (2453) | mairiage (1755) | wife (1399) | ireland (1153) |
it (33334) | admeenistrative (4191) | baund (2435) | union (1737) | popular (1394) | commonly (1150) |
bi (32150) | roushie (4168) | nummer (2431) | acause (1737) | territory (1394) | sootheast (1148) |
wi (30629) | san (4085) | kirk (2414) | fitbaw (1737) | relations (1388) | oreeginal (1145) |
on (30060) | time (4059) | faimily (2408) | greek (1735) | open (1386) | gien (1144) |
for (29667) | cried (4022) | auld (2404) | period (1730) | back (1375) | americae (1143) |
frae (25821) | naitional (3997) | soothren (2400) | company (1728) | public (1364) | single (1142) |
that (22820) | three (3921) | nor (2392) | politeecian (1716) | commonty (1363) | varsity (1139) |
at (18075) | lairgest (3918) | member (2386) | league (1712) | career (1359) | wirk (1137) |
or (17805) | duke (3906) | foondit (2378) | thir (1708) | caur (1357) | anly (1136) |
ceety (17682) | veelage (3847) | german (2372) | empire (1707) | railwey (1353) | members (1131) |
are (17228) | umwhile (3827) | then (2371) | eastren (1707) | meanin (1352) | federal (1131) |
he (17067) | no (3823) | they (2365) | covers (1704) | twinned (1350) | robert (1128) |
haes (16191) | american (3809) | important (2359) | mairit (1703) | cooncil (1349) | provinces (1128) |
his (16011) | states (3800) | john (2357) | due (1685) | late (1348) | role (1126) |
its (15466) | thay (3741) | nou (2342) | auries (1678) | current (1348) | indie (1122) |
population (12558) | seicont (3697) | island (2342) | line (1661) | offeecial (1343) | heid (1120) |
which (12233) | ceeties (3691) | pairty (2339) | last (1654) | climate (1342) | indwallers (1117) |
ane (12073) | durin (3648) | throu (2332) | five (1643) | held (1341) | border (1115) |
municipality (11629) | januar (3492) | fower (2310) | dividit (1642) | famous (1336) | land (1114) |
aw (10854) | mairch (3440) | accordin (2301) | situatit (1641) | model (1333) | faither (1111) |
toun (10754) | french (3407) | breetish (2289) | airport (1636) | during (1320) | estonie (1109) |
de (10232) | years (3404) | hoose (2284) | fitbawer (1634) | different (1318) | middle (1107) |
aurie (10097) | leid (3377) | based (2274) | auncient (1630) | cheenae (1313) | each (1107) |
first (9997) | main (3320) | include (2267) | athin (1627) | islands (1304) | various (1103) |
kent (9884) | central (3306) | sister (2258) | santa (1614) | pairts (1303) | level (1099) |
province (9804) | la (3300) | group (2258) | wad (1601) | anerlie (1300) | pairlament (1098) |
of (9490) | northren (3297) | metal (2251) | such (1596) | with (1299) | will (1097) |
destrict (9305) | later (3291) | wastren (2247) | uise (1589) | ony (1295) | lik (1097) |
this (8987) | august (3278) | hame (2210) | against (1588) | began (1290) | thegither (1097) |
locatit (8797) | internaitional (3245) | their (2202) | italy (1587) | anither (1287) | battle (1093) |
haed (8745) | unner (3169) | refer (2201) | species (1581) | airms (1286) | japan (1086) |
she (8214) | when (3164) | municipal (2182) | law (1578) | actor (1286) | regional (1077) |
maist (8171) | can (3162) | wha (2168) | el (1574) | spaingie (1278) | produced (1075) |
efter (8021) | julie (3159) | series (2166) | times (1570) | ryal (1275) | industrial (1073) |
born (8016) | october (3124) | place (2154) | james (1565) | saunt (1272) | see (1071) |
name (7913) | film (3116) | daith (2123) | foond (1563) | currently (1270) | haein (1063) |
be (7864) | includin (3103) | team (2115) | heich (1558) | emperor (1268) | few (1063) |
her (7428) | made (3074) | son (2105) | dochter (1557) | plays (1266) | mile (1062) |
pairt (7268) | baith (3065) | alang (2097) | urban (1545) | set (1260) | per (1062) |
and (7262) | to (3059) | roman (2095) | armenie (1545) | schuil (1258) | whilk (1056) |
region (7167) | day (3055) | seat (2092) | age (1542) | marie (1257) | autonomous (1052) |
twa (7070) | oblast (3037) | great (2086) | coast (1538) | leids (1256) | range (1051) |
but (6817) | census (3036) | italian (2083) | than (1531) | order (1252) | cultur (1045) |
ither (6717) | fraunce (3014) | was (2079) | ingland (1527) | namit (1252) | approximately (1043) |
north (6676) | whaur (3003) | station (2063) | pairish (1523) | port (1251) | eften (1041) |
aa (6622) | up (2992) | million (2037) | economy (1522) | referred (1250) | abuin (1038) |
been (6596) | september (2967) | best (2037) | form (1515) | teuk (1248) | veelages (1037) |
hae (6360) | till (2962) | near (2014) | amang (1505) | wird (1247) | asie (1036) |
scots (6261) | juin (2961) | same (2005) | del (1502) | juist (1246) | sae (1034) |
war (6238) | november (2950) | oot (2000) | whan (1499) | diveesion (1246) | ae (1031) |
sooth (6204) | nae (2944) | princess (1999) | white (1497) | kintras (1245) | tho (1027) |
wur (6190) | sea (2888) | released (1963) | production (1492) | govrenorate (1245) | cultural (1025) |
mey (6063) | several (2877) | biggit (1958) | pouer (1489) | aften (1243) | sax (1023) |
intae (5990) | republic (2845) | lairge (1928) | borders (1488) | status (1243) | square (1023) |
caipital (5881) | became (2827) | preses (1925) | fae (1482) | reid (1235) | live (1022) |
state (5624) | bein (2824) | him (1902) | muckle (1481) | left (1234) | actress (1021) |
atween (5549) | follaein (2812) | banner (1898) | common (1472) | led (1231) | airmy (1020) |
new (5507) | kintra (2803) | modren (1894) | william (1468) | them (1226) | offeecially (1018) |
warld (5259) | spain (2801) | by (1894) | grand (1468) | housomeivver (1219) | africae (1017) |
fowk (5209) | municipalities (2768) | aprile (1885) | twin (1459) | tot (1215) | apryle (1017) |
river (5151) | km (2745) | watter (1879) | wan (1458) | means (1211) | uisually (1016) |
aboot (5089) | depairtment (2732) | total (1841) | soviet (1457) | sang (1202) | record (1016) |
coonty (5013) | februar (2717) | european (1840) | named (1455) | term (1201) | david (1015) |
thare (4778) | prince (2705) | life (1828) | cup (1453) | meenister (1198) | vera (1013) |
mony (4732) | inglis (2692) | europe (1824) | metropolitan (1452) | side (1197) | service (1009) |
century (4685) | mexico (2678) | louis (1820) | lang (1451) | economic (1196) | written (1008) |
centre (4633) | govrenment (2672) | muisic (1817) | inhabitants (1450) | there (1194) | includit (1006) |
east (4622) | club (2662) | destricts (1814) | lunnon (1447) | historical (1193) | rock (1004) |
year (4566) | local (2632) | raion (1814) | still (1446) | loch (1191) | simmer (1003) |
who (4564) | major (2608) | king (1803) | creatit (1446) | development (1189) | maistly (1000) |
scotland (4537) | early (2591) | germany (1792) | charles (1438) | bc (1178) | swaden (999) |
thair (4498) | december (2582) | played (1792) | established (1434) | brazil (1177) | average (999) |
uised (4493) | album (2549) | road (1791) | smaw (1434) | days (1173) | al (994) |