ToD System Human Evaluation Tool

This directory is dedicated to human evaluations of task-oriented dialogue systems. This toolkit provides an integrated framework for evaluating the performance and user satisfaction of such systems.

This tool aims to streamline the process of human evaluation for dialogue systems, providing a platform for researchers and developers to evaluate the performance of their systems in real-world conversational scenarios.

Repository Overview

The repository is organised into two primary components:

client: This module houses the frontend web interface of the human evaluation tool. It is developed as a React Application.
server: This module contains the backend server code necessary for the operation of the human evaluation tool. It manages data processing, storage, and communication between the frontend interface and the dialogue systems being evaluated.

Getting Started

Our tool is designed with an out-of-the-box capability, facilitated by full containerisation using docker and docker-compose. Try it following this instruction.

Before deployment, the web interface configuration file is located at human_eval_tool/client/src/configs.js. This file includes various settings, such as contact information, which can be customised as required.

Development Setup

To facilitate development and circumvent CORS issues, we use a Dockerised nginx server. The setup process involves:

Environment Configuration:
- Copy and modify the .env file from the deployment folder to both client and server directories.
- Follow the instructions provided in the Command line deployment and development deployment section.
Nginx Deployment:
- In the deployment folder, deploy nginx with:
```
bash deploy.sh --env=dev --component=tools --name {MODEL_NAME}
```
- Replace {MODEL_NAME} with your chosen model name.
Server Setup:
- Modify MODEL_NAME in command_line.sh to your model name.
- Run bash command_line.sh to start the server on port 5000 or execute the commands in the script individually in a terminal.
Client Development Setup:
- Inside the client folder, execute:
```
npm install
npm start
```

The web page and server should now be accessible at http://127.0.0.1:80 for testing.

Extending the Tool

The tool is developed with customisation in mind. If you are looking to tailor this tool to fit your specific needs, such as dialogue evaluation or data collection, consider exploring the following resources:

JavaScript and TypeScript:
- Learn the basics of JavaScript and TypeScript, the languages of our frontend.
ReactJS:
- Have a look at ReactJS.
Ant Design:
- The web interface can be easily extend with any component from the Ant Design toolkit.
Backend Development:
- To accommodate new functionalities, the Flask server's RESTful API may need modifications to handle additional requests from the frontend.

By leveraging these resources, customising the tool is a simple and manageable task. ☺️

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

ToD System Human Evaluation Tool

Repository Overview

Getting Started

Development Setup

Extending the Tool

Files

README.md

Latest commit

History

README.md

File metadata and controls

ToD System Human Evaluation Tool

Repository Overview

Getting Started

Development Setup

Extending the Tool