Dictate to text (voice recording)

  1. Create a project
  2. Add a task pool
  3. Upload tasks
  4. Start the pool and get the results

Run the project in the Sandbox first. This helps you avoid making mistakes and spending money on a task that isn't working right.

Voice recording tasks are most convenient to open in the mobile app for Android or iOS. Mobile apps can record audio directly in a task using the device's built-in voice recorder.

You may need additional projects for your task, such as dataset pre-check or checking performers' responses. Learn more about this in the Designing the solution architecture section.

Let's say you need to collect audio recordings in which users read your text outloud. To do this, create a task in which the performer is given a text phrase and should attach an audio recording as a response.

Example of a prepared task

To run tasks and get responses:

Create a project

The project defines what the task will look like for a performer.

  1. Click the + Create project button and choose the empty template at the bottom of the page.

  2. Enter a clear name and a short description for the project. Performers will see this in the task list.

  3. Write short and clear guidelines (see the recommendations).
  4. Note. This tutorial shows how to create a task interface in Yandex.Toloka. You can also try creating a task interface in the Template Builder.
    Define which objects you are going to pass to the performers and receive from them in response. To do this, add input and output fields in the Specifications block.
    What are input and output data?

    Input data is types of objects that are passed to the performer for completing the task. For example, this could be a text, an image, or geographic coordinates.

    Output data is types of objects that you receive after the task is completed. For example, this could be one of several response options, typed text, or an uploaded file.

    Learn more about input and output data fields.

    In this case they are:

    • Input data field — The phrase string with the text to be voiced by the performer.
    • Output data field — The audio_record file, an audio recording that the performer should upload.
  5. Create the task interface in the HTML and CSS blocks. It describes how the task elements should be arranged in the task.

    You can use standard HTML tags and special expressions in double curly brackets for input and output data fields.

    <div class="text">
    <div class="record">
      {{field type="file" sources="RECORDER" fileType="AUDIO" name="audio_record" label="Открыть диктофон для записи"}}
    This notation describes the following task design:
    • Text from the phrase input field.
    • Button to start the audio recorder and record a file saved in the audio_record field.

    Leave the JavaScript block unchanged.

    Add styles to the CSS block for the correct display on mobile devices. Example for the simplest case:
    .task {
      display: block;
      margin-bottom: 20px;
      margin: 0;
      padding: 20px;
    .text {
      font-size: 18px;
      font-weight: bold;
      line-height: 23px;
    .record {
      margin-top: 5vh;
  6. Click the Preview button to view the task. Lower the screen resolution using the browser tools to make sure that the task looks correctly on mobile devices.
    Note. The project preview shows one task with standard data. You can define the number of tasks to show on the page later.
  7. Save the project.

Add a task pool

A pool is a set of paid tasks sent out for completion at the same time.

  1. Open the project and click Add pool.
  2. Give the pool any convenient name and description. The pool info is only available to you. Performers can view only the project name and description.
  3. Set the price per task page (for instance, $0.02).
    What is a task page?

    A page can contain one or several tasks. If the tasks are simple, you can add 10-20 tasks per page. Don't make pages too long because it slows down loading speed for performers.

    Performers get paid for completing the whole page.

    The number of tasks on the page is set when uploading tasks.

    What is the fair price for a task page?

    The general rule of pricing is the more time the performer spends to complete the task, the higher the price is.

    You can register in Yandex.Toloka as a performer and find out how much other requesters pay for tasks, or see examples of cost for different types of tasks.

  4. Set the Time allowed for completing a task page. It should be long enough to read the guidelines and wait for task data to download (for example, 1200 seconds).
  5. Set the Overlap, which is the number of performers to complete the same task. The value depends on how many recordings of the same phrase you want to collect. If one is enough, put 1.
  6. If there is no adult content in the task in any form, turn off Adult content.
  7. Enable the Non-automatic acceptance option and enter the number of days for checking in the Review period field (for example, 7).
  8. Add Filters to choose performers.
  9. Save the pool.

Upload tasks

Prepare your own task file. Check out the example in a demo TSV file. You can find it on the pool page. At the top-left of the page, there are links to TSV files with regular, control, and training tasks.
  1. Click Upload. In the window that opens, you can also download a sample TSV file by clicking Sample file for uploading tasks.
    What is TSV?
    A TSV file presents a table as a text file in which columns are separated by tabs.
    You can work with it both in a table editor and a text editor, and then save it to the desired format. Learn more about working with a TSV file. There is a CSV format that is similar to TSV, but you should use a TSV file for uploading.
    Note. Before uploading the file, make sure it is saved in UTF-8 encoding.
  2. Add input data in it. The header of the input data column contains the word INPUT. Put the text to read outloud in the INPUT:phrase column and remove the other columns.
  3. Load the tasks. Choose Set manually and set the number of tasks per page (for example, 5). This means that there are 5 phrases per page and the performer has to attach 5 audio files.

Start the pool and get the results

  1. Start the pool by clicking .
  2. Track the completion in the Pool statistics block.
  3. Start the review as soon as you get the first results. After the specified time period, all responses are automatically accepted, regardless of their quality.

    To check the tasks and download the attached files, open the pool and click Download results, and then Download attachments.
    Note. The files received from the Yandex.Toloka app are in WAV 16KHz 16bit PCM format.