question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Gather more data for the chatbot's database via crowd-sourcing

See original GitHub issue

Requirement The sentences.csv file has very limited data which can be used for the initial training. The aim is to gather more data via crowd-sourcing and sources to help improve the responses of the bot via ML models.

Pre-requisite

Elementary knowledge of Python Elementary understanding of the available data

Dependencies None

Description This is an open-ended issue where participants can explore crowd-sourcing to gather the data required for improving the bot’s NLP capabilities. We can either look at using a crowd-sourcing platform (like Amazon Mechanical Turks) or a simple survey form distributed amongst friends.

The primary aim with this bit would be to get a wide variety of questions that people may ask a mapbot i.e. a bot which can answer direction and location information related queries primarily. Please provide the details of the different APIs we’re planning to include in the bot and ask folks to frame their questions based on the set of available capabilites.

As discussed in a similar issue #52, elementary pre-processing of the data might be required before we put it in the db. Please look at sentences.csv to get an idea of the kind of questions we’re handling right now.

Please review your method of gathering data before actually putting it up on a site or sharing it with your friends/batchmates/colleagues

Issue Analytics

  • State:open
  • Created 4 years ago
  • Comments:13 (5 by maintainers)

github_iconTop GitHub Comments

1reaction
shreyanshi2228commented, Mar 30, 2020

Thank you 😃

0reactions
vishakha-lallcommented, May 30, 2020

@shreyanshi2228 while the link is interesting, the idea of using crowd sourcing was so we could concentrate on the intent of the bot (which is navigation related). Do you think you would be able to crowd source the data?

Read more comments on GitHub >

github_iconTop Results From Across the Web

Chatbot Evaluation and Database Expansion via Crowdsourcing
Chatbot Evaluation and Database Expansion via Crowdsourcing ... The long term goal is to create a data set of more appropriate chat responses;....
Read more >
Training Data for Chatbots - Case Study by clickworker
The principal challenge when programming chatbots is correctly recognizing the users' questions, classifying them accurately in the database and issuing the ...
Read more >
24 Best Machine Learning Datasets for Chatbot Training
Best ML datasets for chatbot training. Chatbot training datasets from multilingual dataset to dialogues and customer support chatbots.
Read more >
Chatbots: History, technology, and applications - ScienceDirect
It aims to organize critical information that is a necessary background for further research activity in the field of chatbots. More ...
Read more >
14 Best Chatbot Datasets for Machine Learning - iMerit
In order to create a more effective chatbot, one must first compile realistic, task-oriented dialog data to effectively train the chatbot.
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found