Dataset with four years of condition monitoring technical language annotations from paper machine industries in northern Sweden ... : Dataset med annoteringar av tekniskt språk från fyra års tillståndsövervakning av pappersmaskinsindustri i norra Sverige ...

This dataset consists of four years of technical language annotations from two paper machines in northern Sweden, structured as a Pandas dataframe. The same data is also available as a semicolon-separated .csv file. The data consists of two columns, where the first column corresponds to annotation n...

Full description

Bibliographic Details
Main Authors:	Löwenmark, Karl, Sandin, Fredrik, Liwicki, Marcus, Schnabel, Stephan
Format:	Dataset
Language:	Swedish
Published:	Luleå University of Technology 2023
Subjects:	paper machines pappersmaskiner condition monitoring övervakning av skick language technology språkteknologi signal processing signalbehandling paper industry pappersindustri fault detection natural language processing machines maskiner sciences branches of science vetenskapsgrenar supervision övervakning pulp and paper industry massa- och pappersindustri treatment and handling behandling och hantering detection process industry processindustri action with an object aktivitet med objekt forest industry skogsindustri systems of a society samhälleliga system technical objects physical objects tekniska objekt fysiska inanimate objects icke-levande objekt action verksamhet industry industri systems system basic industry basindustri physical objects fysiska objekt events and action händelser och handling objects entiteter Technical Language Processing Norra Sverige Northern Sweden
Online Access:	https://dx.doi.org/10.5878/hafd-ms27 https://snd.se/catalogue/dataset/2023-257/1

Description
Summary:	This dataset consists of four years of technical language annotations from two paper machines in northern Sweden, structured as a Pandas dataframe. The same data is also available as a semicolon-separated .csv file. The data consists of two columns, where the first column corresponds to annotation note contents, and the second column corresponds to annotation titles. The annotations are in Swedish, and processed so that all mentions of personal information are replaced with the string ‘egennamn’, meaning “personal name” in Swedish. Each row corresponds to one annotation with the corresponding title. Data can be accessed in Python with: import pandas as pd annotations_df = pd.read_pickle("Technical_Language_Annotations.pkl") annotation_contents = annotations_df['noteComment'] annotation_titles = annotations_df['title'] ... : Detta dataset består av tekniskt-språk-annoteringar från fyra års insamling från två pappersmaskiner i norra Sverige, strukturerat som en Pandas dataframe. Samma data finns också tillgänglig som en semikolonseparerad .csv-fil. Datan består av två kolumner, där den första kolumnen motsvarar annoteringens textinnehåll, och den andra titeln. Annoteringarna är skrivna på svenska, och processade så att alla egennamn ersatts av textsträngen ’egennamn’. Varje rad motsvarar en annotering med titel. Data behandlas i Python med: import pandas as pd annotations_df = pd.read_pickle("Technical_Language_Annotations.pkl") annotation_contents = annotations_df['noteComment'] annotation_titles = annotations_df['title'] ...

Dataset with four years of condition monitoring technical language annotations from paper machine industries in northern Sweden ... : Dataset med annoteringar av tekniskt språk från fyra års tillståndsövervakning av pappersmaskinsindustri i norra Sverige ...

Similar Items