Linguistics Realization

In linguistics, realization is the process by which some kind of surface representation is derived from its underlying representation; that is, the way in which some abstract object of linguistic analysis comes to be produced in actual language.

Phonemes are often said to be realized by speech sounds. The different sounds that can realize a particular phoneme are called its allophones.

Realization is also a subtask of natural language generation, which involves creating an actual text in a human language (English, French, etc.) from a syntactic representation. There are a number of software packages available for realization, most of which have been developed by academic research groups in NLG. The remainder of this article concerns realization of this kind.

Example

For example, the following Java code causes the simplenlg system [2] to print out the text The women do not smoke.:

NPPhraseSpec subject = nlgFactory.createNounPhrase("the", "woman"); subject.setPlural(true); SPhraseSpec sentence = nlgFactory.createClause(subject, "smoke"); sentence.setFeature(Feature.NEGATED, true); System.out.println(realiser.realiseSentence(sentence)); 

In this example, the computer program has specified the linguistic constituents of the sentence (verb, subject), and also linguistic features (plural subject, negated), and from this information the realiser has constructed the actual sentence.

Processing

Realisation involves three kinds of processing:

Syntactic realisation: Using grammatical knowledge to choose inflections, add function words and also to decide the order of components. For example, in English the subject usually precedes the verb, and the negated form of smoke is do not smoke.

Morphological realisation: Computing inflected forms, for example the plural form of woman is women (not womans).

Orthographic realisation: Dealing with casing, punctuation, and formatting. For example, capitalising The because it is the first word of the sentence.

The above examples are very basic, most realisers are capable of considerably more complex processing.

Systems

A number of realisers have been developed over the past 20 years. These systems differ in terms of complexity and sophistication of their processing, robustness in dealing with unusual cases, and whether they are accessed programmatically via an API or whether they take a textual representation of a syntactic structure as their input.

There are also major differences in pragmatic factors such as documentation, support, licensing terms, speed and memory usage, etc.

It is not possible to describe all realisers here, but a few of the emerging areas are:

  • Simplenlg [3]: a document realizing engine with an api which intended to be simple to learn and use, focused on limiting scope to only finding the surface area of a document.
  • KPML [4]: this is the oldest realiser, which has been under development under different guises since the 1980s. It comes with grammars for ten different languages.
  • FUF/SURGE [5]: a realiser which was widely used in the 1990s, and is still used in some projects today
  • OpenCCG [6]: an open-source realiser which has a number of nice features, such as the ability to use statistical language models to make realisation decisions.

References

  • [7] - ACL NLG Portal (contains links to the above and many other realisers)

Tags:

Linguistics Realization ExampleLinguistics Realization ProcessingLinguistics Realization SystemsLinguistics RealizationAbstractionAllophoneLinguisticsPhonemeSpeech soundSurface representation

🔥 Trending searches on Wiki English:

Kaya ScodelarioOppenheimer (film)Ariana GrandeTeri Baaton Mein Aisa Uljha JiyaList of countries and dependencies by population2003 Angola Boeing 727 disappearanceMister SinisterMothers' Instinct (2024 film)ThailandHiroyuki SanadaIslamic State – Khorasan ProvinceXVideosJohn Wilkes BoothSofía VergaraThe Pirate BayTasman Bridge disasterPhilippe TroussierJennifer LopezBen AffleckWikiErnie HudsonBramayugamNetflixJason MomoaAnna SawaiTruman CapoteNXT Stand & Deliver (2024)Josh DoanSolar eclipse of April 8, 20242024 Miami Open – Women's singlesMatthew McConaugheyMichael SchumacherBattlegrounds Mobile IndiaLiverpool F.C.Forced perspectiveThe Notorious B.I.G.Henry CavillMonk (TV series)Rachin RavindraMain PageSolar eclipseOkujepisa omukazenduGeneration ZThe Three-Body Problem (novel)War (card game)William ShakespeareVanessa HudgensList of solar eclipses visible from the United StatesPrince Harry, Duke of SussexUnited States Supreme Court BuildingImmaculate (2024 film)Cheryl HinesNarendra ModiTed KaczynskiXXXXPremier LeagueJoe KeeryVideoToni KroosFranklin D. RooseveltDraft lottery (1969)Christian DiorPreity ZintaSelena GomezWikipediaJustin TimberlakeList of James Bond filmsList of highest-grossing filmsA Serbian FilmSoaking (sexual practice)Mumbai IndiansGeorgia (country)Drag Me to HellGoogle Maps2020 United States presidential electionRoman EmpireDiana, Princess of WalesFascism🡆 More