CLARIN-Servicezentrum des Zentrums Sprache an der BBAW
CLARIN-Servicezentrum des Zentrums Sprache an der BBAW

Tokenizer and Sentence Splitter

Header

URL:https://hdl.handle.net/21.11120/0000-0005-A208-B
CMDI profile:clarin.eu:cr1:p_1320657629644
Name of collection:WebLicht Webservice Orchestrator

Resources

Service description

About the service

Name:Tokenizer and Sentence Splitter
Description:detects word- and sentence boundaries in raw text using WASTE (http://www.dwds.de/waste/)
Application type:webService
Type of webservice:RESTfull
URLhttp://kaskade.dwds.de/waste/tokenize.fcgi?mode=tcf
Life cycle status:production
Publication date:2010-12-19T23:10:10+01:00
Last update:2014-12-09T11:05:12+01:00
Contact:jurish@bbaw.de
Creator:Berlin-Brandenburg Academy of Sciences and Humanities

Service Operations

Name:Default
Input parameters:
nameallow manual selection fallbackvalues
langfalsede
textfalse
typefalsetext/tcf+xml
versionfalse0.4
Output replaces input:false
Output parameters:
namevalues
sentences
tokens
XSL transformation, contact, imprint, privacy policy, 2023