"!#"$ %'&(*)+), %'-/.102340.15 36.10879#::<; 9#=.2340>@?::1ACB DE $"!"F GH$ IKJLEH M ON PQ PQR M ON P R M ON P PQ DE $"!#FS GHK$ I"!#"$ T'UWVX'Y#Z[8\8['] ^'ZV`_Ia bzcdy"c e'vfbc ghe'i"vz jlk a']#me^'i"y"znmpoqa']rcd[vkskt u] VZZY"ab avohwa'x^'iys] ] a'] ghe'i"vz wa'x^'iys] ] a'] ghe'i"vzoza']rt'uwvx'y"z[ { JKL}HI ~ SIG R M % R P P ƒ G f ˆ f f!"h$ (h00. %Š>5. A4AŒ =340 = Ž.19 # *=0' y: ; 340> '3CB #34-/B RS Ž Ž B.1AC. 0'3 5=9#B3 R : š R š œ@ Ž Ž B.1AC. % ž= 2 =0 *.13 A4Ÿ.10'0.1 A 340> ' Bd= J# I # f ha6>. O= 2 # /=9 )+.1 h =A A Kªl r«d!" 1 73C=9d9#= 7.1 'Az% : 0 2.> h ' ± -/:10' = ;²9#= =rž ³9#.B =rb #9 '- # '9#=µ9 'AC=dB R AC:1-/.1A=9 9#: 9r9 'AC= B R % ž= 236B ³ H f# G ¹ º#»¹¼yº ¾#»¹»ÁÀàŒÄ ÅÆ»ÇÅÆº" ÈÃɺ#Èû ÈÃÊË»ÅÄŗÌÇÊÆ ¹É1ÄÊ»¹ Œ ¹ ž ÈÃÊ4» Å ¾ ¹ žízîw»ì ÏÆÅv žÄл Ñ»¹»² WÎ Ï˺"º#»¹ W¾ ¹ žº"â» µòôó+óïë»áà+ä ŒÅv»² žïë º"» Õ ŒÄÖ} ¹ÖOÀ»¹ ÐÏË ¹ Ö}»¹ ºØÙ»ÔÌ»¹Ð»¹Êv ¹ÀÃÖO»¹ ºr ¹¾º#» ÈÃÊË»ÅÂrÄлÁÑ»¹»¹ ÑKÄ Åv»Ìˆ ¹ WÊvÄŗÚ È+Ä Ú»Á»¹ Œ ¹ ŒÅÃÏË Wº#» Û Œ ¹ ¹ À+ ¹ ŒÄIÜÝĺ ÄÑKÄ Åv»ÔÞ Ü»¹ÊËÏ4л² ŒÄÑ ÊË»Çß ØQà Ø áø âã Ä žìôº"â»çä žäê4äsåæ» Å+ ¹¾ Ú ŒÄÖ}Öåĺ"ÏËÉ1ÄÊq»¹ Œ ¹ ŒÅ+Þ Ü»¹ÊËÏËл¹ ŒÄÑ Ê4»Çß Ø âø áã"ø Ù»ÔÚ ŒÄÖOÖpÄ ž ÈÃÊË»ÅÉ Ä WÂžÄŗÌÊË»ÇÌ»¹ÉÊvÄ ŒÄº"ÏËл ÉÊvÄÈ+Åv»ÅÎ Ï˺"Â Ä žìôî Ï˺"Âr ¹ÈúÏË Ð»² ÅvÏv ¹ æä žìˆåæèãñk ¹ ŒÌ Ï4 žäº"»áéêväè+åæ» Å ØÙ» Ú ŒÄÖ}ÖåÄ žé1ä Ì»¹º#»¹ÉºrÄ Ú»¹» ÖO»¹ º»¹ Œ ¹ ŒÅÃÏË ž ¹Èà WÀàŒÄ Åv»Åvæ Àà Œ ¹Ñ ÊË»²ÖpÅÃÎWÏ˺#ÂWÉ1Ä ÅÆ» ÏË ž ¹Èà ÀàŒÄ ÅÆ» ÅÆæ»¹ Œ ¹» ¹È+Åл¹ Ñ ÅÆ» çè㻹 É» ÅÆæÄ žìô黲 ºÄÏËȨÅvº# ÈÃɺ"Èà ŒÄÊ»² Œ ¹ ŒÅèØ
Executive Summary In the CORRie parser utility tool GramComp, a preliminary grammar for Swedish have been written to detect agreement in noun phrases, exceptions from these agreement rules, case problems, erroneous verb sequences, and certain structural problems. The grammar contain context-free phrase structure rules and local error rules written in accordance with the formalism presented by Vosse in The Word Connection (1994) and in the documentation to the CORRie parser and GramComp. A special formalism for local error rules has thus not been developed within the SCARRIE project. The development of the grammar rules has been based on test sentences collected from the Error Corpora Database (Deliverable 2.1.3.2). Different types of declarative clause constructions are covered, including formal object and object with infinitive. Subordinate clauses can also be handled.
Contents 1 Introduction...1 2 Grammar rules in CORRie...2 3 Grammar rules for Swedish...3 3.1 Clauses, verb phrases and verb complements...3 3.2 Determiners, adjective phrases, and noun phrases...4 3.3 Adverb phrases, prepositional phrases, and other phrase types...4 4 What to do next...5 Literature...6 Appendices: A Context-free Grammar for Swedish in the CORRie formalism...7 B Test Sentences...15 C Dictionary...19
1 Introduction The objectives were to write context-free phrase structure rules for Swedish, to specify a formalism for local error rules, and to write local error rules. Originally, these three objectives were to form a deliverable each, but the deliverables are joined into one since the tasks have been carried out using the CORRie parser for which a formalism for local error rules already exists. Furthermore, the performance of local error rules and context-free grammar rules are intertwined, and therefore best developed side by side. As a first step, in accordance with the proposed order of priority presented in Three Types of Grammatical Errors in Swedish (Wedbjer Rambell 1998, Deliverable 6.2.3, p. 36), the grammar is to handle: agreement within noun phrases exceptions from agreement rules case problems verb sequences certain structural problems A short presentation of the grammar framework is made in the next chapter. The third chapter is focused on the grammar rules for Swedish, which is presented in full detail in Appendix A. The last chapter present what to focus on next in the future work with the grammar checker. 1
2 Grammar rules in CORRie The preliminary grammar has been developed in the GramComp framework and will be incorporated in the CORRie system at Uppsala university. The grammar consists of context-free phrase structure rules and local error rules written in accordance with the formalism presented by Vosse in The Word Connection (1994) and in the documentation of the CORRie parser and GramComp. The context-free phrase structure rules are built up with categories and features. An example of such a rule is given in (1) below: NP(Number NPspecies Case) -> DetField(Gender Number NPspecies NNspecies) APOpt(Gender Number NPspecies basic Degree) NN(Gender Number NNspecies Case) (1) From an error detection point of view, they can be implicit or explicit about erroneous language usage. The implicit rules detect an error when the features cannot be unified, and the parsing process continues thanks to feature propagation. The explicit rules (or local error rules), on the other hand, are excepted without needing constraint relaxation for continuing the parsing process. If such a local error rule is used in the parsing process, the error expressed in the rule will be signalled as a result of the parsing. If the parser can use the local error rule (2), the message within quotation marks will be presented as a result of the parsing process: FV( ) -> VB(_ [inf sup] _) "Infinite verb should be a finite verb" (2) Features and rules can be given error weights to govern which parse to be presented as the best alternative. The lower the weight, the better is the parsing result. If no weight is given, it is set to 1 as default. In the local error rule (3), the error weight is set to 3. FV( ) -> "Finite verb is missing":3 (3) By assigning the error weight 5 to the feature taking the values [pres pret] in the example below, the parser prefers the local error rule (2) above since the error weight will thus only be 3. FV(Voice Verbtype) -> VB(Voice [pres pret]:5 Verbtype) (4) The design of the grammar involves choosing between the two approaches for different kinds of errors. In principal, structural errors concerning erroneous sequences of categories are described by local error rules, and non-structural errors concerning feature mis-matching are detected by implicit grammar rules. However, non-structural errors may be described by local error rules as well. The two types of rules are closely connected to each other. For instance, insertion of a local error rule may affect the already existing grammar rules. Therefore, the implicit and the explicit grammar rules will be presented together and not separated into different deliverables. 2
3 Grammar rules for Swedish The development of the grammar rules has been based on test sentences collected from the Error Corpora Database (Wedbjer Rambell et al 1998). The test sentences represent error types that should be primarily focused on: agreement problems in noun phrases, case problems, and problems with verb sequences. The sentences show a variety of clause constructions. In connection to this work, lexical information to be stored in the dictionary has been established. The grammar can be found in Appendix A, and the test sentences in Appendix B. The dictionary needed for parsing the test sentences can be found in Appendix C. 3.1 Clauses, verb phrases and verb complements The grammar incorporates declarative clauses with and without inversion. Imperative clauses are also handled, but questions have not been dealt with yet. The declarative clause without inversion contains, in functional terms, a subject, a verb phrase, and an optional subordinate clause: DeklS1 -> NP( [basic subj gen]) VP1 SubSopt (5) The verb phrase VP1 contains a finite verb, an optional clause adverbial, optional infinite verbs, an optional particle, a verb complement, and an optional adverbial: VP1 -> FV([act depon]:3 main) SadvOpt PartOpt Vcomp AdvlOpt (6) There are several VP1 rules to make it possible to handle different types of verb sequences: main finite verb, temporal auxiliary verb + the supine, modal auxiliary verb + the infinitive, and combinations of the auxiliary verbs + main verb. There is also a VP1 rule covering constructions with formal object. The verb complement Vcomp may be empty (for intransitive verbs), it may be a noun phrase (for transitive verb), a personal pronoun + a noun phrase (for bitransitive verbs), or a personal pronoun (for reflexive verbs). It could also be an adjective phrase (functioning as subjective complement), a noun phrase + an adjective phrase (functioning as object and objective complement), an infinitive phrase, a that-clause (att-sats), an adjective phrase + an infinitive phrase, or an adjective phrase + a that-clause. As it is now, verb valency is not used for testing that a verb has been given the correct complements. The verb complements in Vcomp can not be reused in the declarative clauses with inversion in which a verb complement (other than the subject) begins the clause. The Vcomp2 in (7) can be a noun phrase, an adjective phrase, an infinitive phrase, or a that-clause (att-sats). However, the Vcomp are used when the clauses start with an adverbial or a subordinate clause, as in (8). DeklS2 -> Vcomp2 FV(act main) NP( _) SadvOpt PartOpt AdvlOpt (7) DeklS2 -> SubS FV(act main) NP( _) SadvOpt PartOpt Vcomp AdvlOpt (8) There are only one DeklS1 rules but several DeklS2 rules, since different constituents can be put in the first position. Complements in prepositional phrases may stand first: DeklS2 -> NP( [basic obj]) FV(act main) NP( _) SadvOpt Vcomp AdvlOpt PR (9) 3
Subordinate clauses that can be handled by the preliminary grammar are that-clauses (attsatser), relative clauses, and other subordinate clauses. There are two types of relative clauses depending on whether the relative pronoun functions as the subject or the object of the relative clause. In these subordinate clauses, the clause adverbial precedes the finite verb, why new verb phrase rules must be introduced. An example of such a rule is given in (10). VP3 -> SadvOpt FV(act:3 main) PartOpt Vcomp AdvlOpt (10) 3.2 Determiners, adjective phrases, and noun phrases The noun phrase may contain a determiner field, an adjective phrase, and a noun phrase, (see example rule 1 above). After the head noun, a participle phrase or a relative clause may follow. When designing rules to cover these options, there are several restrictions and exceptions to consider. The sequence of determiners is quite strict. It can be divided into six determiner types appearing in a certain order: total, determinative, possessive, quantitative, selective, comparative (Loman 1956, Thorell 1974 p. 52). Furthermore, there are two types of determinative determiners: those demanding the head noun to be in the definite form, and those demanding the head noun to be in the indefinite form. The former determinative type and the determiners expressing totality form together Det1. The latter determinative type and the possessive determiners form Det2, which makes the head noun to be indefinite. Finally, the last three determiner types form Det3. The adjective phrase rules have been designed to cover the problems present in the test sentences, and are therefore not so general. To detect erroneous word category of the premodifier (e.g. an adjective should be changed into an adverb), missing comma is used as a signal of the problem. The noun phrase rules cover the species agreement limitations that are invoked by the determiners. Furthermore, case problems are handled, and so are agreement problems with appositions. A tentative approach has been made towards article usage in noun phrases with adjective attribute, and in elliptic noun phrases. 3.3 Adverb phrases, prepositional phrases, and other phrase types The optional adverbial used in the clause rules (e.g. rule example 6 above), may consist of an adverb phrase and/or a prepositional phrase. The clause adverbial may also be a prepositional phrase, since PPs may take that position in the clause. A preposition may take a noun phrase or an infinitive phrase as its complement. Usage of a conjunction instead of a preposition is captured in a local error rule. Punctuation errors may be detected by local error rules. One such rule has been written to handle missing commas in coordination of main clauses. 4
4 What to do next The rules for adjective phrases must be rewritten in order to make them more general and accurate. The noun phrase rules may also be revised. Possible positions for comma signs should be marked in the rules. The next main error types to incorporate in the grammar, according to the proposed order of priority (Wedbjer Rambell 1998, p 36) is: verb valency agreement between NP (subject) and AP (subjective complement) noun valency adjective valency pronoun case 5
Literature Ahrenberg, Lars (1994): Formell grammatik för naturligt språk. Linköping University, Department of Computer and Information Science. Linköping. Ljung, Magnus & Ohlander, Sölve (1982). Allmän grammatik. Stockholm. Loman, Bengt (1956): Om relationen mellan ordföljd och betydelse hos framförställda attributiva bestämningar till substantiviska huvudord. In: Arkiv för nordisk filologi 71, p. 218-244. Teleman, Ulf (1974): Manual för grammatisk beskrivning av talad och skriven svenska. Lund. Thorell, Olof (1973): Svensk grammatik. Stockholm. Wedbjer Rambell et al. (1998). An Error Database of Swedish. SCARRIE, Deliverable 2.1.3.2, version 1.0. Uppsala University, Department of Linguistics. Wedbjer Rambell, Olga (1998): Three Types of Grammatical Errors in Swedish. SCAR- RIE, Deliverable 6.2.3, version 1.0. Uppsala University, Department of Linguistics. Vosse, Theo (1994). The Word Connection. Grammar-based Spelling Error Correction in Dutch. Amsterdam. 6
Appendix A: Context-free Grammar for Swedish in the CORRie formalism -- types GenderType = utr neutr NumberType = sg pl SpeciesType = indef def DefType = indef def DegreeType = pos comp sup CaseType = basic gen subj obj PronFuncType = själv fören DettypeType = tot det1 det2 poss quant select comp -- while det1 unifies with the head noun and det2 has separate conditions NumtypeType = card ord VoiceType = act pass depon VerbformType = inf pres pret imp sup VerbtypeType = main temp mod cop ParttypeType = pres perf SadvType = yes no SNtypeType = att other -- variables Gender @ GenderType Gender1 @ GenderType Gender2 @ GenderType Number @ NumberType Number1 @ NumberType Number2 @ NumberType Species @ SpeciesType Species1 @ SpeciesType NPspecies @ SpeciesType NNspecies @ SpeciesType Noundef @ SpeciesType Form @ SpeciesType Form1 @ SpeciesType Form2 @ SpeciesType Def @ DefType Def1 @ DefType Degree @ DegreeType Case @ CaseType Case1 @ CaseType Case2 @ CaseType PronFunc @ PronFuncType Dettype @ DettypeType Numtype @ NumtypeType Voice @ VoiceType Verbtype @ VerbtypeType Parttype @ ParttypeType Sadv @ SadvType SNtype @ SNtypeType -- non-terminals S DeklS DeklS1 DeklS2 DeklS3 QuestS ImpS SubS 7
ThatS RelS(GenderType NumberType) RelS2(GenderType NumberType) RelSOpt(GenderType NumberType) NP(NumberType SpeciesType CaseType) App(GenderType NumberType CaseType) PmP(GenderType NumberType CaseType) AP(GenderType NumberType SpeciesType CaseType DegreeType) APOpt(GenderType NumberType SpeciesType CaseType DegreeType) AVPopt(GenderType NumberType SpeciesType CaseType DegreeType) Det(DettypeType GenderType NumberType SpeciesType) DetOpt(DettypeType GenderType NumberType SpeciesType) Det1(GenderType NumberType SpeciesType) Det1Opt(GenderType NumberType SpeciesType) Det2(GenderType NumberType SpeciesType) Det3(GenderType NumberType SpeciesType) Det3Opt(GenderType NumberType SpeciesType) DetField(GenderType NumberType SpeciesType SpeciesType) DetFieldOpt(GenderType NumberType SpeciesType SpeciesType) VP VP1 VP3 FV(VoiceType VerbtypeType) IV(VerbformType VerbtypeType) IVmodOpt VBopt(VoiceType VerbformType VerbtypeType) PartOpt IP IVP Vcomp AbP AbPOpt SadvOpt Advl AdvlOpt PP PPOpt CommaObl -- terminals NN(GenderType NumberType SpeciesType CaseType) -- nouns PM(GenderType CaseType) -- proper nouns AV(GenderType NumberType SpeciesType CaseType DegreeType) -- adjectives PN(GenderType NumberType CaseType) -- personal pronouns PS(GenderType NumberType) -- possessive pronouns PE(CaseType) -- reciprocal pronouns PV(GenderType NumberType CaseType) -- relative pronouns PI(GenderType NumberType CaseType PronFuncType) -- interrogative pronouns PD(DettypeType SpeciesType GenderType NumberType CaseType) -- demonstrative & indefinite pronouns AL(GenderType NumberType SpeciesType) -- articles NL(NumtypeType NumberType CaseType) -- numerals VB(VoiceType VerbformType VerbtypeType) -- verbs PC(ParttypeType GenderType NumberType SpeciesType CaseType) -- participles AB(SadvType) -- adverbs PR -- prepositions CN -- coordinating conjunctions 8
SN(SNtypeType) -- subordinating conjunctions IE -- infinitive mark Punc -- end of sentence punctuations marks Quest -- question mark Comma -- comma -- RULES -- Clauses & Sentences S -> DeklS Punc S -> QuestS Quest S -> ImpS Punc -- Declarative clauses: DeklS -> DeklS1 DeklS -> DeklS2 DeklS -> DeklS CommaObl CN DeklS -- coordination of main clauses with obligatory comma DeklS -> CN DeklS1 DeklS -> CN DeklS2 -- Clause beginning with the subject DeklS1 -> NP( [basic subj gen]) VP1 SubSopt -- pasive voice DeklS1 -> NP( _) FV(pass main) SadvOpt AdvlOpt -- verb phrases in clauses beginning with the subject: VP1 -> FV([act depon]:3 main) SadvOpt PartOpt Vcomp AdvlOpt -- main verb in finite form VP1 -> FV(act temp) SadvOpt VB(_ sup main) PartOpt Vcomp AdvlOpt -- perfect VP1 -> FV(act temp) SadvOpt VB(_ sup mod) IV(inf main) PartOpt Vcomp AdvlOpt -- perfect with modal verb(s) VP1 -> FV(act mod:2) SadvOpt IV(inf main) PartOpt Vcomp AdvlOpt -- modal verb(s) VP1 -> FV(act cop) SadvOpt PC(perf _ basic) AdvlOpt -- copula VP1 -> SadvOpt VB(_ sup main) PartOpt Vcomp AdvlOpt "Missing temporal auxiliary verb" -- rules for handling clauses with formal object (a specific case in which the verb takes object and objective complement): VP1 -> FV([act depon]:3 main) PN(neutr sg subj) SadvOpt AP( _) ThatS FV(Voice Verbtype) -> VB(Voice [pres pret]:5 Verbtype) FV( ) -> VB(_ [inf sup] _) "Infinite verb should be a finite verb" FV( ) -> VB(_ imp _) "Change to presens or preteritum" FV( ) -> "Finite verb is missing":3 IV(inf main) -> IVmodOpt VB(_ inf main) IV(inf main) -> IVmodOpt VBopt(act inf temp) VB(_ sup main) IVmodOpt -> VB(act inf mod) IVmodOpt IVmodOpt -> VBopt(act inf temp) VB(act sup mod) IVmodOpt -- the infinitive of the temporal auxiliary is optional IVmodOpt -> VBopt(act inf temp) -> VB(act inf temp:3) VBopt( _) -> Vcomp -> -- intransitive verbs Vcomp -> NP( _) -- transitive verbs Vcomp -> IndObj NP( _) -- bitransitive verbs Vcomp -> PN( obj) -- reflexive verbs 9
Vcomp -> AP( _) -- verb with subjective complement Vcomp -> NP( _) AP( _) -- verb with direct object and objective complement Vcomp -> IP -- infinitive phrase Vcomp -> NP( [basic obj]) IVP -- object with infinitive Vcomp -> AP( _) IP -- adjective phrase plus infinitive phrase Vcomp -> ThatS -- that-clause ("att-sats") Vcomp -> AP( _) ThatS -- adjective phrase plus that-clause IP -> IE SadvOpt VB([act depon] inf main) PartOpt Vcomp IP -> IE SadvOpt FV( ) VB([act depon] inf main) PartOpt Vcomp "Remove the finite verb" IVP -> VB([act depon] inf:2 main) PartOpt Vcomp IndObj -> PN( obj:3) IndObj -> PmP( _) -- Sentences with inversion -- the finite verb precedes the subject because another constituent than the subject in first position -- clauses beginning with verb complement (object etc) DeklS2 -> Vcomp2 FV(act main) NP( _) SadvOpt PartOpt AdvlOpt -- main verb in finite form DeklS2 -> Vcomp2 FV(act temp) NP( _) SadvOpt VB(_ sup main) PartOpt AdvlOpt -- perfect DeklS2 -> Vcomp2 FV(act temp) NP( _) SadvOpt VB(_ sup mod) IV(inf main) PartOpt AdvlOpt -- perfect with modal verb(s) DeklS2 -> Vcomp2 FV(act mod) NP( _) SadvOpt IV(inf main) PartOpt AdvlOpt -- modal verb(s) Vcomp2 -> NP( _) -- transitive verbs Vcomp2 -> AP( _) -- verb with subjective complement Vcomp2 -> IP -- infinitive phrase Vcomp2 -> ThatS -- that-clause ("att-sats") -- rules to handle object with infinitive in inverted clauses DeklS2 -> NP( [basic obj]) FV(act main) NP( _) SadvOpt IVP AdvlOpt DeklS2 -> NP( [basic obj]) FV(act temp) NP( _) SadvOpt VB(_ sup main) IVP AdvlOpt DeklS2 -> NP( [basic obj]) FV(act mod) NP( _) SadvOpt VB(_ inf main) IVP AdvlOpt -- the complement of the prepositional phrase stands in the first position: DeklS2 -> NP( [basic obj]) FV(act main) NP( _) SadvOpt Vcomp AdvlOpt PR DeklS2 -> NP( [basic obj]) FV(act temp) NP( _) SadvOpt VB(_ sup main) Vcomp AdvlOpt PR DeklS2 -> NP( [basic obj]) FV(act mod) NP( _) SadvOpt VB(_ inf main) Vcomp AdvlOpt PR -- clauses beginning with adverbial: DeklS2 -> Advl FV(act main) NP( _) SadvOpt PartOpt Vcomp AdvlOpt DeklS2 -> Advl FV(act temp) NP( _) SadvOpt VB(_ sup main) PartOpt Vcomp AdvlOpt DeklS2 -> Advl FV(act temp) NP( _) SadvOpt VB(_ sup main) IV(inf main) PartOpt Vcomp AdvlOpt DeklS2 -> Advl FV(act mod) NP( _) SadvOpt IV(inf main) PartOpt Vcomp AdvlOpt -- clauses beginning with a subordinate clause: DeklS2 -> SubS FV(act main) NP( _) SadvOpt PartOpt Vcomp AdvlOpt DeklS2 -> SubS FV(act temp) NP( _) SadvOpt VB(_ sup main) PartOpt Vcomp AdvlOpt DeklS2 -> SubS FV(act temp) NP( _) SadvOpt VB(_ sup main) IV(inf main) PartOpt Vcomp AdvlOpt DeklS2 -> SubS FV(act mod) NP( _) SadvOpt IV(inf main) PartOpt Vcomp AdvlOpt -- Imperative clause ImpS -> VB([act depon] imp main) SadvOpt PartOpt Vcomp AdvlOpt -- Subordinated clauses -- the clause adverbial ("satsadverbialet") precedes the finite verb ("BIFF-regeln") ThatS -> SN(att:3) DeklS3 -- "att-sats" 10
SubS -> SN(other) DeklS3 -- other subordinate clauses SubSopt -> SubSopt -> SubS RelS(Gender Number) -> PV(Gender Number _) VP3 RelS(Gender Number) -> PV(Gender Number _) DeklS3 RelS(Gender Number) -> VP3 "missing relative pronoun":3 -- relative clauses with optional relative pronoun: RelS2(Gender Number) -> RelS(Gender Number) RelS2(Gender Number) -> NP( _) VP3 RelSOpt( ) -> RelSOpt(Gender Number) -> RelS(Gender Number) DeklS3 -> NP( _) VP3 VP3 -> SadvOpt FV(act:3 main) PartOpt Vcomp AdvlOpt -- main verb in finite form and active voice VP3 -> SadvOpt FV(pass main) AdvlOpt -- main verb in finite form and passive voice VP3 -> SadvOpt FV(act temp) VB(_ sup main) PartOpt Vcomp AdvlOpt -- perfect VP3 -> SadvOpt VB(_ sup:2 main) PartOpt Vcomp AdvlOpt -- dito without temporal auxiliary VP3 -> SadvOpt FV(act temp) VB(_ sup mod) IV(inf main) PartOpt Vcomp AdvlOpt -- perfect with modal verb(s) VP3 -> SadvOpt VB(_ sup:2 mod) IV(inf main) PartOpt Vcomp AdvlOpt -- dito without temporal auxiliary VP3 -> SadvOpt FV(act mod) IV(inf main) PartOpt Vcomp AdvlOpt -- modal verb(s) VP3 -> SadvOpt FV(act cop) PC(perf _ basic) AdvlOpt -- copula -- NP rules: -- Pronouns NP(Number def Case) -> PN(_ Number Case) -- personal pronoun NP(Number def Case) -> PN(_ Number Case) RelS(Gender Number) -- dito with relative clause NP(Number def basic) -> PS(_ Number) -- possessive pronoun NP(Number def obj) -> PE(basic) -- reciprocal pronoun NP(Number def Case) -> PI(_ Number Case själv) -- interrogative pronoun NP(Number Species Case) -> PD(_ Species _ Number Case) -- demonstrative or indefinite pronoun NP(Number Species Case) -> PD(_ Species _ Number Case) RelS(Gender Number) -- dito with relative clause -- Proper noun NP(Number def Case) -> PmP(Gender Number Case) -- Common noun NP(Number Species Case) -> NN(Gender Number Species Case) -- Noun phrase with determiner and optional adjective phrase: NP(Number NPspecies Case) -> DetField(Gender Number NPspecies NNspecies) APOpt(Gender Number NPspecies basic Degree) NN(Gender Number NNspecies Case) -- dito with necessary relative clause: NP(Number NPspecies Case) -> DetField(Gender Number NPspecies NNspecies) APOpt(Gender Number NPspecies basic Degree) NN(Gender Number indef Case) RelS2(Gender Number) -- Noun phrase without determiner but with adjective phrase and optional relative clause -- participles ought to be preceded by a determiner in the singular (?) NP(pl indef Case) -> AP(Gender pl indef:2 basic Degree) NN(Gender pl indef:2 Case) RelSOpt(Gender pl) NP(pl def Case) -> AP(Gender pl def basic Degree) NN(Gender pl def Case) RelSOpt(Gender pl) "":3 -- many false alarms? NP(sg Species Case) -> AVPopt(Gender sg:2 Species basic Degree) NN(Gender sg:2 Species Case) RelSOpt(Gender sg) 11
-- Noun with genitive attribute (noun or proper noun): NP(Number def Case) -> NP( gen:3) APOpt(Gender Number def basic Degree) NN(Gender Number indef Case) NP(Number def Case) -> PM(_ gen) APOpt(Gender Number def basic Degree) NN(Gender Number indef Case) -- Noun phrases with adverb + numeral (could be saved as multi-word expressions and be given a special phrase category): NP(Number indef Case) -> AbP NL(card pl basic) NN(Gender Number indef Case) NP(Number Species Case) -> AP(Gender Number Species Case Degree) NL(card pl basic) NN(Gender Number Species Case) "erroneous adjective before the numeral":3 -- Elliptic noun phrases NP(Number def Case) -> DetField(Gender Number def def) AP(Gender Number def Case Degree) -- Coordination of NP:s NP(Number Species Case) -> NP(Number Species Case) CN NP(Number Species Case) -- Noun with apposition: NP(Number Species Case) -> NP(Number Species Case) App(Gender Number basic:3) App(Gender Number Case) -> PmP(Gender Number Case) -- Proper noun phrases PmP(Gender sg Case) -> PM(Gender Case) PmP(Gender sg Case) -> PM(Gender Case) PM(Gender Case) PmP(Gender pl Case) -> PM(Gender Case) CN PM(Gender Case) -- Noun phrase with prepositional phrase NP(Number Species Case) -> NP(Number Species Case) PP -- Noun phrase with infinitive phrase NP(Number Species Case) -> NP(Number Species Case) IP -- Noun phrase with shortened clause NP(Number Species Case) -> NP(Number Species Case) PC(pres _ basic) NP( _) -- Determiner rules: --Possessive determiners: Det(poss Gender Number def) -> PS(Gender Number) -- possessive pronoun Det(poss Gender Number def) -> PI(Gender Number gen fören) -- interrogative pronoun in genitive case Det(poss Gender Number def) -> PD(Dettype Species Gender Number gen:3) -- dem & indef pronoun in the genitive case -- Other determiners: Det(select Gender Number indef) -> PI(Gender Number basic fören) -- interrogative pronoun in basic case Det(tot Gender Number Species) -> PD(tot Species Gender Number basic) -- dem & indef pronoun in the basic case Det(Dettype Gender Number Species) -> PD(Dettype Species Gender Number basic) -- dem & indef pronoun in the basic case Det(det1 Gender Number def) -> AL(Gender Number def) Det(quant Gender Number indef) -> AL(Gender Number indef) Det(quant Gender Number indef) -> NL(card Number basic) Det(select Gender Number def) -> NL(ord Number basic) DetOpt( ) -> DetOpt(Dettype Gender Number Species) -> Det(Dettype Gender Number Species) DetOpt(Dettype Gender Number Species) -- determiners tot & det1: det1 governs the species of the head noun if det1 is present Det1(Gender Number Species) -> Det(tot:3 Gender Number Species) 12
Det1(Gender Number Species) -> DetOpt(tot Gender Number Species1) Det(det1:3 Gender Number Species) Det1Opt( _) -> Det1Opt(Gender Number Species) -> Det1(Gender Number Species) -- determiners det2 & poss: Det2(Gender Number Species) -> Det(det2:3 Gender Number Species) DetOpt(poss Gender Number Species) Det2(Gender Number Species) -> DetOpt(det2 Gender Number Species) Det(poss:3 Gender Number Species) -- determiners quant, select & comp: Det3(Gender Number Species) -> Det(quant:3 Gender Number Species) DetOpt(quant Gender Number Species) DetOpt(select Gender Number Species) DetOpt(comp Gender Number Species) Det3(Gender Number Species) -> DetOpt(quant Gender Number Species) Det(select:3 Gender Number Species) DetOpt(select Gender Number Species) DetOpt(comp Gender Number Species) Det3(Gender Number Species) -> DetOpt(quant Gender Number Species) DetOpt(select Gender Number Species) Det(comp:3 Gender Number Species) DetOpt(comp Gender Number Species) Det3Opt( _) -> Det3Opt(Gender Number Species) -> Det3(Gender Number Species) -- Rules for the whole determiner field: -- determiners det2 & poss affects the species of the head noun (it should be in the indefinite form): DetField(Gender Number Species indef) -> Det1Opt(Gender Number Species1) Det2(Gender Number Species) Det3Opt(Gender Number Species1) -- otherwise, the first determiner determines what the species of the NP should be: DetField(Gender Number Species Species) -> Det1(Gender Number Species) Det3Opt(Gender Number Species1) DetField(Gender Number Species Species) -> Det3(Gender Number Species) DetFieldOpt( ) -> DetFieldOpt(Gender Number Species Species1) -> DetField(Gender Number Species Species1) -- AP rules: APOpt( _) -> APOpt(Gender Number Species Case Degree) -> AP(Gender Number Species Case Degree) AVPopt(Gender Number Species Case Degree) -> AbPOpt AV(Gender Number Species Case Degree) AVPopt(Gender Number Species Case Degree) AVPopt(Gender Number Species Case Degree) -> -- single adjective: AP(Gender Number Species Case Degree) -> AV(Gender Number Species Case Degree) -- single participle: AP(Gender Number Species Case pos) -> PC(Parttype Gender Number Species Case) -- coordinated adjectives: AP(Gender Number Species Case Degree) -> AV(Gender Number Species Case Degree) CN AV(Gender Number Species Case Degree) -- with adverb as premodifier: AP(Gender Number Species Case Degree) -> AbP AV(Gender Number Species Case Degree) AP(Gender Number Species Case pos) -> AbP PC(Parttype Gender Number Species Case) -- with erroneous adjective as premodifiers or missing comma: AP(Gender Number Species Case Degree) -> AV(Gender Number Species Case Degree) AV(Gender Number Species Case Degree) "the first adjective: ought to be an adverb or a comma ought to be inserted after it":2 13
AP(Gender Number Species Case Degree) -> AV(Gender Number Species Case Degree) PC(Parttype Gender Number Species Case) "the first adjective: ought to be an adverb or a comma ought to be inserted after it":2 -- participle followed by an adjective: AP(Gender Number Species Case Degree) -> PC(Parttype Gender Number Species Case) AV(Gender Number Species Case Degree) -- AbP rules: AbP -> AB(_) AbP -> AB(_) AbP AbPOpt -> AbP AbPOpt -> SadvOpt -> AB(yes) SadvOpt -> AbP AB(yes) SadvOpt -> PP SadvOpt -> NL(card _ basic) SadvOpt -> Advl -> AbP AdvlOpt Advl -> PP AdvlOpt AdvlOpt -> AdvlOpt -> Advl PartOpt -> PartOpt -> AB(no) -- PP rules PP -> PR NP( [basic obj]) PP -> PR IP PP -> SN(other) NP( _) "change the conjunction into a preposition" PPOpt -> PP PPOpt -> -- Rules for punctuation marks CommaObl -> Comma CommaObl -> "Comma missing":2 14
Appendix B: Test Sentences The test sentences consist of a correct sentence and an incorrect one (marked with *). The language errors are collected from the Error Corpora Database, and formed into full sentences if only a sentence fragment has been stored in the database. -- TESTs of GPNPAG (agreement problems in noun phrases) de områden som tror tror. *de områdena som tror tror. de slutgiltiga siffrorna existerar. *det slutgiltiga siffrorna existerar. de grymma djurtransporterna existerar. *det grymma djurtransporterna existerar. de första medlingarna existerar. *det första medlingarna existerar. de större maskinerna och traktorerna existerar. *det större maskinerna och traktorerna existerar. de många mörka vintertimmarnas slit existerar. *det många mörka vintertimmarnas slit existerar. de kanske mest personliga områden vi ser existerar. *det kanske mest personliga områden vi ser existerar. det tidiga 1800-talet existerar. *det tidiga 1800-talen existerar. det slutgiltiga beskedet existerar. *det slutgiltiga beskeden existerar. den_här soppan existerar. *den_här soppar existerar. de ser sina skuggor. *de ser sin skuggor. han ser sin företrädare Anders Björk. *han ser sina företrädare Anders Björk. faktiska och praktiska möjligheter att tro existerar. *faktiskt och praktiska möjligheter att tro existerar. det är vart tredje år. *det är var tredje år. en kristen radiostation existerar. *ett kristet radiostation existerar. det är en viktig behandlingsmöjlighet för dem. *det är en viktig behandlingsmöjligheten för dem. det är kyrkans officiella ordning. *det är kyrkans officiell ordning. den förre statsministern Carl Bildt var där. *den förre statsminister Carl Bildt var där. Aftonbladets ledarsida drabbades i helgen av akut sjukdomsinsikt beträffande svenska ekonomin och arbetsmarknaden. 15
*Aftonbladets ledarsida drabbades i helgen av akut sjukdomsinsikt beträffande svenska ekonomi och arbetsmarknad. ungdomen har i betydligt större utsträckning accepterat det. *ungdomen har i betydligt utsträckning än övriga accepterat det. -- TESTs of GPNPCA (case problems in noun phrases) olika sorters människor existerar. *olika sorter människor existerar. det gynnar konsumenternas intressen. *det gynnar konsumenterna intressen. Tierps kommun stöder sig på detta. *Tierp kommun stöder sig på detta. det sade Moderaternas ekonomiske talesman. *det sade Moderaterna ekonomiske talesman. Roger Pettersson försvarar sin doktorsavhandling. *Roger Petterssons försvarar sin doktorsavhandling. tredje rullbanan på Arlanda tas i bruk i_dag. *tredje rullbanan på Arlandas tas i bruk i_dag. TESTs of GPNPAP01 (word category of premodifier of the adjective) nästan hundra procent är bra. *nästa hundra procent är bra. en riktigt uppkäftig korvsenap på Düsseldorf kan panelen också ordinera. *en riktig uppkäftig korvsenap på Düsseldorf kan panelen också ordinera. det är en kraftigt ökad alkoholkonsumtion. *det är en kraftig ökad alkoholkonsumtion. TESTs of GPNPPN (pronoun problems in the noun phrase) det är en opinion som aldrig behövt ta ett reellt politiskt ansvar. *det är en opinion aldrig behövt ta ett reellt politiskt ansvar. tävlingen som riktar sig enbart till kvinnliga kockar instiftades för att förnya köket. *tävlingen riktar sig enbart till kvinnliga kockar instiftades för att förnya köket. TEST of GPPPPR05 (wrong word category; not a preposition) beslut väntas före sommaren. *beslut väntas innan sommaren. TESTs of GPVFMF01-03 (doubled main verb in the finite form) de präglas alla av detta. *de är präglas alla av detta. det bytte dock namn i samband med den första privatiseringen under Thatcherepoken. *det blev bytte dock namn. polisen hörde flera vittnen under kvällen och utredningen kommer att fortsätta under tisdagen. *polisen har hörde flera vittnen under kvällen. TESTs of GPVFMF04-09 (infinite verb instead of infinite verb) 16
lagens långa arm fäste dock inget avseende vid detta. *lagens långa arm fästa dock inget avseende vid detta. de avskedade kvinnorna får rådet att starta eget. *de avskedade kvinnorna få rådet att starta eget. -- erroneosuly accepted by the parser as an object with infinitive han blir den mittfältsstrateg som Gimo saknat. *han bli den mittfältsstrateg som Gimo saknat. betänk också de anläggningskostnader som tillkommer. *betänkt också de anläggningskostnader som tillkommer. det blir sång och musik. *det bli sång och musik. det finns många som använder namnet. *det finns många som använda namnet. risken finns att det inte blir något. *risken finns att det inte bli något. TESTs of GPVFTS01-02, 08 (erroneous infinite form) men nu har hoppet tänts igen. *men nu har hoppet tänds igen. vi har inte kvantifierat det, men det är klart att det får ekonomiska konsekvenser. *vi har inte kvantifiera det, men det är klart att det får ekonomiska konsekvenser. Carl har skrivit brev till Anders. *Carl har skrivet brev till Anders. TESTs of GPVFTS05 (missing auxiliary verb) men det har kommit in jordbakterier i maten. *men det kommit in jordbakterier i maten. vi har lärt oss prioritera om när det gäller städningen. *vi lärt oss prioritera om när det gäller städningen. TESTs of GPVFAI01, 05 (infinite verb instead of finite verb) om människor börjar tro på en förändring är det bra. *om människor börja tro på en förändring är det bra. ett gott skratt kan göra den suraste på gott humör. *ett gott skratt kana göra den suraste på gott humör. TESTs of GPVFAM02-04, 11 (finite verb instead of the infinitive) det låter jag vara osagt. *det låter jag var osagt. där kan den framväxande nätkulturen vara början till intresset. *där kan den framväxande nätkulturen var början till intresset. -- the parser erroneosulr accepts the sentencence: -- "var" is interpreted as a determiner tåg ska rulla. *tåg ska rullar. 17
i dammarna brukar det finnas änder. *i dammarna brukar det finns änder. det får man ha förståelse för. *det får man har förståelse för. domstolen fann det bevisat att Markus Wolf 1962 lät kidnappa den avhoppade östtyske Stasiagenten. *domstolen fann det bevisat att Markus Wolf 1962 lät kidnappade den avhoppade östtyske Stasiagenten. TESTs of GPVFAM (combination of auxiliary verbs + main verb) de kan väl inte ha varit så dumma. *de kan väl inte har varit så dumma. för säkerhets skull har församlingen redan låtit montera ned ett antal störande mursegment. *för säkerhets skull har församlingen redan låt montera ned ett antal störande mursegment. TESTs of GPVFIP (infinitive in infinitive phrase) de boende har störst möjlighet att framföra sina åsikter. *de boende har störst möjlighet att framför sina åsikter. han har lovat att i alla fall slå Turkiet. *han har lovat att i alla fall skall slå Turkiet. 18
Appendix C: Dictionary The dictionary used for testing the sentences in Appendix B are presented below. The words are listed after word category in the following order: nouns, proper nouns, adjectives, pronouns, numerals, verbs, participles, adverbs, prepositions, conjunctions, and punctuation marks. 1800-talet NN(neutr sg def basic) 1800-talen NN(neutr pl def basic) alkoholkonsumtion NN(utr sg indef basic) anläggningskostnader NN(utr pl indef basic) ansvar NN(neutr sg indef basic) antal NN(neutr sg indef basic) arbetsmarknad NN(utr sg indef basic) arbetsmarknaden NN(utr sg def basic) arm NN(utr sg indef basic) avseende NN(neutr sg indef basic) banor NN(utr pl indef basic) behandlingsmöjlighet NN(utr sg indef basic) behandlingsmöjligheten NN(utr sg def basic) beskedet NN(neutr sg def basic) beskeden NN(neutr pl def basic) beslut NN(neutr _ indef basic) besvären NN(utr pl def basic) biskop NN(utr sg indef basic) biskopen NN(utr sg def basic) brev NN(neutr _ indef basic) bruk NN(neutr sg indef basic) början NN(utr sg indef basic) dammarna NN(utr pl def basic) djurtransporterna NN(utr pl def basic) doktorsavhandling NN(utr sg indef basic) domstolen NN(utr sg def basic) ekonomi NN(utr sg indef basic) ekonomin NN(utr sg def basic) fall NN(neutr _ indef basic) frigolit NN(utr sg indef basic) företrädare NN(utr _ indef basic) församlingen NN(utr sg indef basic) förståelse NN(utr sg indef basic) förändring NN(utr sg indef basic) helgen NN(utr sg def basic) hoppet NN(neutr sg def basic) humör NN(neutr sg indef basic) intressen NN(neutr pl indef basic) intresset NN(neutr sg def basic) jordbakterier NN(utr pl indef basic) kana NN(utr sg indef basic) kockar NN(utr pl indef basic) kommun NN(utr sg indef basic) konsekvenser NN(utr pl indef basic) konsumenterna NN(utr pl def basic) konsumenternas NN(utr pl def gen) korvsenap NN(utr sg indef basic) kyrkans NN(utr sg def gen) kvinnorna NN(utr pl def basic) kvällen NN(utr sg def basic) köket NN(neutr sg def basic) lagens NN(utr sg def gen) leader NN(utr sg indef basic) ledarsida NN(utr sg indef basic) lår NN(neutr _ indef basic) maskinerna NN(utr pl def basic) maten NN(utr sg def basic) 19
medlingarna NN(utr pl def basic) mittfältsstrateg NN(utr sg indef basic) mursegment NN(neutr sg indef basic) musik NN(utr sg indef basic) människa NN(utr sg indef basic) människor NN(utr pl indef basic) möjlighet NN(utr sg indef basic) möjligheter NN(utr pl indef basic) namn NN(neutr _ indef basic) namnet NN(neutr sg def basic) nätkulturen NN(utr sg def basic) områden NN(neutr pl indef basic) områdena NN(neutr pl def basic) opinion NN(utr sg indef basic) ordning NN(utr sg indef basic) panelen NN(utr sg def basic) polisen NN(utr sg def basic) privatiseringen NN(utr sg def basic) procent NN(utr _ indef basic) radiostation NN(utr sg indef basic) risken NN(utr sg def basic) rullbanan NN(utr sg def basic) rumpa NN(utr sg indef basic) rådet NN(neutr sg def basic) samband NN(neutr _ indef basic) siffrorna NN(utr pl def basic) sjukdomsinsikt NN(utr sg indef basic) skall NN(neutr sg indef basic) skratt NN(neutr sg indef basic) skuggor NN(utr pl indef basic) skull NN(utr sg indef basic) slit NN(neutr _ indef basic) sommaren NN(utr sg def basic) soppan NN(utr sg def basic) soppar NN(utr pl indef basic) sorter NN(utr pl indef basic) sorters NN(utr pl indef gen) Stasiagenten NN(utr sg def basic) statsminister NN(utr sg indef basic) statsministern NN(utr sg def basic) städningen NN(utr sg def basic) sång NN(utr sg indef basic) säkerhets NN(utr sg indef gen) talesman NN(utr sg indef basic) taxi NN(utr _ indef basic) Thatcherepoken NN(utr sg def basic) tisdagen NN(utr sg def basic) traktorerna NN(utr pl def basic) tåg NN(neutr _ indef basic) tävlingen NN(utr sg def basic) ungdomen NN(utr sg def basic) utredningen NN(utr sg def basic) utsträckning NN(utr sg indef basic) vintertimmarnas NN(utr pl def gen) vittnen NN(neutr pl indef basic) vänskap NN(utr sg indef basic) år NN(neutr _ indef basic) åsikter NN(utr pl indef basic) änder NN(utr pl indef basic) Aftonbladets PM(neutr gen) Anders PM(utr basic) Arlanda PM(utr basic) Arlandas PM(utr gen) Bildt PM(utr basic) Björk PM(utr basic) Carl PM(utr basic) Düsseldorf PM(utr basic) Gimo PM(utr basic) 20
Markus PM(utr basic) Moderaterna PM(utr basic) Moderaternas PM(utr gen) Roger PM(utr basic) Pettersson PM(utr basic) Petterssons PM(utr gen) Tierp PM(utr basic) Tierps PM(utr gen) Turkiet PM(neutr basic) UNT:s PM(utr gen) Wolf PM(utr basic) akut AV(_ sg indef basic pos) betydligt AV(neutr sg indef basic pos) AB(no) bra AV( _ basic pos) dumma AV(_ sg def basic pos) AV(_ pl _ basic pos) dyrt AV(neutr sg indef basic pos) AB(no) eget AV(neutr sg indef basic pos) ekonomiska AV(_ sg def basic pos) AV(_ pl _ basic pos) ekonomiske AV(utr sg def basic pos) faktiskt AV(neutr sg indef basic pos) AB(_) faktiska AV(_ sg def basic pos) AV(_ pl _ basic pos) förre AV(utr sg def basic pos) gott AV(neutr sg indef basic pos) AB(no) grymma AV(_ sg def basic pos) AV(_ pl _ basic pos) klart AV(neutr sg indef basic pos) AB(no) kraftig AV(utr sg indef basic pos) kraftigt AV(neutr sg indef basic pos) AB(no) kristen AV(utr sg indef basic pos) kristet AV(neutr sg indef basic pos) kvinnlig AV(utr sg indef basic pos) kvinnliga AV(_ sg def basic pos) AV(_ pl _ basic pos) långa AV(_ sg def basic pos) AV(_ pl _ basic pos) mörka AV(_ sg def basic pos) AV(_ pl _ basic pos) nya AV(_ sg def basic pos) AV(_ pl _ basic pos) nästa AV(_ sg indef basic pos) nödvändig AV(utr sg indef basic pos) nödvändigt AV(neutr sg indef basic pos) officiell AV(neutr sg indef basic pos) officiella AV(_ sg def basic pos) AV(_ pl _ basic pos) olika AV(_ sg def basic pos) AV(_ pl _ basic pos) osagt AV(neutr sg indef basic pos) personliga AV(_ sg def basic pos) AV(_ pl _ basic pos) politiskt AV(neutr sg indef basic pos) AB(no) praktiska AV(_ sg def basic pos) AV(_ pl _ basic pos) reellt AV(neutr sg indef basic pos) AB(no) riktig AV(utr sg indef basic pos) riktigt AV(neutr sg indef basic pos) AB(no) slutgiltiga AV(_ sg def basic pos) AV(_ pl _ basic pos) stor AV(utr sg indef basic pos) större AV( _ basic comp) störst AV( _ basic sup) suraste AV( _ basic sup) svenska AV(_ sg def basic pos) AV(_ pl _ basic pos) svåraste AV( _ basic sup) tidiga AV(_ sg def basic pos) AV(_ pl _ basic pos) uppkäftig AV(utr sg indef basic pos) viktig AV(utr sg indef basic pos) östtyske AV(utr sg def basic pos) övriga AV(_ sg def basic pos) AV(_ pl _ basic pos) jag PN(utr sg subj) du PN(utr sg subj) han PN(utr sg subj) hon PN(utr sg subj) den PN(utr sg [subj obj]) det PN(neutr sg [subj obj]) man PN(utr sg subj) vi PN(utr pl subj) ni PN(utr pl subj) 21
de PN(_ pl subj) mig PN(utr sg obj) dig PN(utr sg obj) honom PN(utr sg obj) henne PN(utr sg obj) sig PN(utr _ obj) oss PN(utr pl obj) er PN(utr pl obj) dem PN(utr pl obj) min PS(utr sg) mitt PS(neutr sg) mina PS(_ pl) din PS(utr sg) ditt PS(neutr sg) dina PS(_ pl) sin PS(utr sg) sitt PS(neutr sg) sina PS(_ pl) hans PS( ) hennes PS( ) vår PS(utr sg) vårt PS(neutr sg) våra PS(_ pl) er PS(utr sg) ert PS(neutr sg) era PS(_ pl) deras PS( ) varandra PE(basic) varandras PE(gen) som PV( basic) vilken PV(utr sg basic) PI(utr sg basic _) vilket PV(neutr sg basic) PI(neutr sg basic _) vilka PV(_ pl basic) PI(_ pl basic _) vad PV(_ sg basic) PI(_ sg basic _) vars PV( gen) vem PI(utr sg basic själv) vems PI(utr sg gen fören) alla PD(tot indef _ pl basic) denna PD(det2 indef utr sg basic) detta PD(det2 indef neutr sg basic) dessa PD(det2 indef _ pl basic) det_här PD(det1 def utr sg basic) den_här PD(det1 def utr sg basic) de_här PD(det1 def utr sg basic) flera PD(quant indef _ pl basic) inget PD(tot indef neutr sg basic) någon PD(_ indef utr sg basic) något PD(_ indef neutr sg basic) några PD(_ indef _ pl basic) mången PD(quant indef utr sg basic) många PD(quant indef _ pl basic) var PD(tot indef utr sg basic) vart PD(tot indef neutr sg basic) varje PD(tot indef utr sg basic) övriga PD(select indef _ pl basic) en AL(utr sg indef) ett AL(neutr sg indef) den AL(utr sg def) det AL(neutr sg def) de AL(_ pl def) 1962 NL(card pl basic) två NL(card pl basic) hundra NL(card pl basic) första NL(ord _ basic) tredje NL(ord _ basic) accepterat VB(act sup main) använda VB(act inf main) använder VB(act pres main) 22
avskedade VB(act pret main) PC(perf _ sg def basic) PC(perf _ pl _ basic) behövt VB(act sup [main mod]) betänk VB(act imp main) betänkt VB(act sup main) bevisat VB(act sup main) PC(perf neutr sg indef basic) blev VB(act pret [main cop]) bli VB(act inf [main cop]) blir VB(act pres [main cop]) brukar VB(act pres [main mod]) bytte VB(act pret main) börja VB(act inf [main mod]) börjar VB(act pres [main mod]) drabbades VB(pass pret main) existerar VB(act pres main) fann VB(act pret [main mod]) finnas VB(pass inf main) finns VB(act pres main) fortsätta VB(act inf main) framför VB(act pres main) framföra VB(act inf main) få VB(act inf main) får VB(act pres [main mod]) fästa VB(act inf main) fäste VB(act pret main) förnya VB(act inf main) försvarar VB(act pres main) handikappade VB(act pret main) PC(perf _ sg def basic) PC(perf _ pl _ basic) gynnar VB(act pres main) gäller VB(act pres main) göra VB(act inf main) ha VB(act inf [main temp]) har VB(act pres [main temp]) hörde VB(act pret main) inför VB(act imp main) instiftades VB(pass pret main) kan VB(act pres [main mod]) kana VB(act inf main) kidnappa VB(act inf main) kidnappade VB(act pret main) PC(perf _ sg def basic) PC(perf _ pl _ basic) kommer VB(act pres main) kommit VB(act sup main) kvantifiera VB(act inf main) kvantifierat VB(act sup main) lovat VB(act sup main) låt VB(act imp [main mod]) låter VB(act pres [main mod]) låtit VB(act sup [main mod]) lärt VB(act sup main) lät VB(act pret [main mod]) montera VB(act inf main) ordinera VB(act inf main) prioritera VB(act inf main) präglas VB(depon pres main) påstås VB(pass pres main) riktar VB(act pres main) rulla VB(act inf main) rullar VB(act pres main) sade VB(act pret main) saknat VB(act sup main) ser VB(act pres main) ska VB(act pres mod) skall VB(act pres mod) skrivit VB(act sup main) slimma VB(act inf main) slå VB(act inf main) starta VB(act inf main) stöder VB(act pres main) ta VB(act inf main) 23
tas VB(pass pres main) tillkommer VB(act pres main) tro VB(act inf main) tror VB(act pres main) tänds VB(pass pres main) tänka VB(act inf main) tänts VB(pass sup main) uppstod VB(act pret main) var VB(act pret [main cop]) vara VB(act inf [main cop]) varit VB(act sup [main cop]) väntade VB(act pret main) PC(perf _ sg def basic) PC(perf _ pl _ basic) väntas VB(pass pres main) är VB(act pres [main cop]) avhoppade PC(perf _ sg def basic) PC(perf _ pl _ basic) beträffande PC(pres _ basic) boende PC(pres _ basic) framväxande PC(pres _ basic) handikappande PC(pres _ basic) införd PC(perf utr sg indef basic) manglad PC(perf utr sg indef basic) skrivet PC(perf neutr sg indef basic) störande PC(pres _ basic) väntande PC(pres _ basic) ökad PC(perf utr sg indef basic) aldrig AB(yes) enbart AB(_) dock AB(yes) där AB(no) framför AB(no) i_dag AB(no) igen AB(no) in AB(no) inte AB(yes) kanske AB(_) också AB(_) mest AB(no) ned AB(no) nu AB(no) nästan AB(no) om AB(no) redan AB(_) så AB(no) väl AB(yes) av PR för PR AB(no) före PR i PR inför PR med PR på PR till PR under PR vid PR än PR men CN och CN att SN(att) IE innan SN(other) när SN(other) om SN(other) \. Punc \! Punc \? Quest \, Comma 24
Bibliography and References Ahrenberg, Lars (1994): Formell grammatik för naturligt språk. Linköping University, Department of Computer and Information Science. Linköping. Ljung, Magnus & Ohlander, Sölve (1982). Allmän grammatik. Stockholm. Loman, Bengt (1956): Om relationen mellan ordföljd och betydelse hos framförställda attributiva bestämningar till substantiviska huvudord. In: Arkiv för nordisk filologi 71, p. 218-244. Teleman, Ulf (1974): Manual för grammatisk beskrivning av talad och skriven svenska. Lund. Thorell, Olof (1973): Svensk grammatik. Stockholm. Wedbjer Rambell et al. (1998). An Error Database of Swedish. SCARRIE, Deliverable 2.1.3.2, version 1.0. Uppsala University, Department of Linguistics. Wedbjer Rambell, Olga (1998): Three Types of Grammatical Errors in Swedish. SCARRIE, Deliverable 6.2.3, version 1.0. Uppsala University, Department of Linguistics. Vosse, Theo (1994). The Word Connection. Grammar-based Spelling Error Correction in Dutch. Amsterdam.