Truth-O-Meter: Making neural content meaningful and truthful

Thursday, April 13, 2023

A text obtained by a deep learning (DL) content generation system (raw text) usually has major issues in terms of randomness and incorrectness. We build a content improvement system specifically oriented towards repairing these errors by finding correct and consistent sentences from various sources and substituting problematic entities, phrases, and sentences in raw content with the correct ones. We use text mining to identify correct corresponding sentences and the syntactic and semantic generalization procedure adopted to the content improvement task. We observed that raw content produced by a DL system like GPT-3 can be substantially improved for factual correctness and meaningfulness.

 

Post Talk Link:  Click Here 

Passcode: %jM9Y+.c

 

 

Speaker/s

Boris Galitsky contributed linguistic and machine learning technologies to Silicon Valley startups as well as companies like eBay and Oracle for over 25 years. Boris’ information extraction and sentiment analysis techniques assisted a number of acquisitions, such as Xoopit by Yahoo, Uptake by Groupon, Loglogic by Tibco and Zvents by eBay. His security-related technologies of document analysis contributed to acquisition of Elastica by Semantec: https://github.com/bgalitsky/relevance-based-on-parse-trees. As an architect of the Intelligent Bots project at Oracle, Boris developed a discourse analysis technique user for dialogue management and published in the book ""Developing Enterprise Chatbots”. He also published a two-volume monograph “AI for CRM”, based on his experience developing Oracle Digital Assistant. Boris is Apache committer to OpenNLP where he created OpenNLP.Similarity component which is a basis for a semantically-enriched search engine and chatbot development. Galitsky’s exploration and formalization of human seasoning culminated in the book “Computational Autism” broadly used by parents of children with autistic reasoning and rehabilitation personnel. Boris focus on medical domain led to another research monograph, “AI for Health Applications and Management”. https://www.amazon.com/Books-Boris-Galitsky/s?rh=n%3A283155%2Cp_27%3ABoris+Galitsky An Author of 150+ publications, 50+ patents and 6 books, Boris’s focus now is on improving content generation quality.

Related