MagicData
SIGN IN

Total Size: 2.51KB

Dataset Overview

Dataset Type

Text corpus for NLP

Language

zh-CN

Speech Style

N/A

Content

Automobile cabin commands and control

Audio Parameters

N/A

File Format

TXT (UTF8)

Recording Equipment

N/A

Recording Environment

N/A
Open Source
NLP Corpus
500 sentences

NLP-CAutoCabCC: A Chinese Automobile Cabin Command Corpus

This dataset consists of 500 pieces of commands and control expressions in the Chinese language in an automobile cabin setting. 10 cabin control functions are covered, and 10-100 semantically generalized commands and control expressions are correlated to each of the functions.

More specifically, functions in a cabin setting include engine and WIFI initiation, curtain adjustment, etc. And the generalized expressions aim to show diversity in syntax — verbs, entity words, and adverbs are flexibly combined. Expressions of the cabin function points in diverse forms are included as well, i.e., electronic stability program=ESP.

To enable efficient text processing, a slot is reserved, i.e., position, fraction, percentage, degree, mode, number, etc. So developers can generate more expressions according to their specific needs, such as position=[front, rear, driver, pilot, rear left, rear right, left, all].

Recommended application scenario: automobile cabin voice assistant.

Sample:

功能泛化
打开发动机自动启停开始自动启停
打开发动机自动启停开一下发动机自动启停
打开发动机自动启停开一下发动机自动启停系统
打开发动机自动启停开一下自动启停
打开发动机自动启停开自动启停
打开发动机自动启停弄开发动机自动启停
打开发动机自动启停弄开发动机自动启停系统

Dataset Overview

Dataset Type

Text corpus for NLP

Language

zh-CN

Speech Style

N/A

Content

Automobile cabin commands and control

Audio Parameters

N/A

File Format

TXT (UTF8)

Recording Equipment

N/A

Recording Environment

N/A
{{ reviewsTotal }}{{ options.labels.singularReviewCountLabel }}
{{ reviewsTotal }}{{ options.labels.pluralReviewCountLabel }}
{{ options.labels.newReviewButton }}
{{ userData.canReview.message }}

Verifying Email