Unsupervised learning of morphology for English and Inuktitut

We describe a simple unsupervised technique for learning morphology by identifying hubs in an automaton. For our purposes, a hub is a node in a graph with in-degree greater than one and out-degree greater than one. We cre-ate a word-trie, transform it into a minimal DFA, then identify hubs. Those hu...

Full description

Bibliographic Details
Main Authors: Howard Johnson, Joel Martin
Other Authors: The Pennsylvania State University CiteSeerX Archives
Format: Text
Language:English
Subjects:
Online Access:http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.451.9886
http://clair.si.umich.edu/clair/HLT-NAACL03/shorts/pdf/hlt_naacl_03_shortpaper_314.pdf
id ftciteseerx:oai:CiteSeerX.psu:10.1.1.451.9886
record_format openpolar
spelling ftciteseerx:oai:CiteSeerX.psu:10.1.1.451.9886 2023-05-15T16:55:35+02:00 Unsupervised learning of morphology for English and Inuktitut Howard Johnson Joel Martin The Pennsylvania State University CiteSeerX Archives application/pdf http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.451.9886 http://clair.si.umich.edu/clair/HLT-NAACL03/shorts/pdf/hlt_naacl_03_shortpaper_314.pdf en eng http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.451.9886 http://clair.si.umich.edu/clair/HLT-NAACL03/shorts/pdf/hlt_naacl_03_shortpaper_314.pdf Metadata may be used without restrictions as long as the oai identifier remains attached to it. http://clair.si.umich.edu/clair/HLT-NAACL03/shorts/pdf/hlt_naacl_03_shortpaper_314.pdf text ftciteseerx 2016-01-08T05:56:15Z We describe a simple unsupervised technique for learning morphology by identifying hubs in an automaton. For our purposes, a hub is a node in a graph with in-degree greater than one and out-degree greater than one. We cre-ate a word-trie, transform it into a minimal DFA, then identify hubs. Those hubs mark the boundary between root and suffix, achieving similar performance to more com-plex mixtures of techniques. 1 Text inuktitut Unknown
institution Open Polar
collection Unknown
op_collection_id ftciteseerx
language English
description We describe a simple unsupervised technique for learning morphology by identifying hubs in an automaton. For our purposes, a hub is a node in a graph with in-degree greater than one and out-degree greater than one. We cre-ate a word-trie, transform it into a minimal DFA, then identify hubs. Those hubs mark the boundary between root and suffix, achieving similar performance to more com-plex mixtures of techniques. 1
author2 The Pennsylvania State University CiteSeerX Archives
format Text
author Howard Johnson
Joel Martin
spellingShingle Howard Johnson
Joel Martin
Unsupervised learning of morphology for English and Inuktitut
author_facet Howard Johnson
Joel Martin
author_sort Howard Johnson
title Unsupervised learning of morphology for English and Inuktitut
title_short Unsupervised learning of morphology for English and Inuktitut
title_full Unsupervised learning of morphology for English and Inuktitut
title_fullStr Unsupervised learning of morphology for English and Inuktitut
title_full_unstemmed Unsupervised learning of morphology for English and Inuktitut
title_sort unsupervised learning of morphology for english and inuktitut
url http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.451.9886
http://clair.si.umich.edu/clair/HLT-NAACL03/shorts/pdf/hlt_naacl_03_shortpaper_314.pdf
genre inuktitut
genre_facet inuktitut
op_source http://clair.si.umich.edu/clair/HLT-NAACL03/shorts/pdf/hlt_naacl_03_shortpaper_314.pdf
op_relation http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.451.9886
http://clair.si.umich.edu/clair/HLT-NAACL03/shorts/pdf/hlt_naacl_03_shortpaper_314.pdf
op_rights Metadata may be used without restrictions as long as the oai identifier remains attached to it.
_version_ 1766046580509507584