AnacondaCon 2018. David Sullivan. "DUKE: Dataset Understanding through Knowledge-base Embeddings" produces abstractive descriptions of datasets based on word2vec model trained on wikipedia paired with a curated ontology. For those familiar with word2vec, you can think of DUKE as essentially "dataset2vec". This talk will discuss the technology behind DUKE, how DUKE can be used to improve the data science and data engineering process, and how the audience can download and use the software.
I would like to work with open source projects to create a branch of the tree with all
of the best videos for your open source project. Please
send me an email if you are interested.