A Declarative Query Language for Scientific Machine Learning

The popularity of data science as a discipline and its importance in the emerging economy and industrial progress dictate that machine learning be democratized for the masses. This also means that the current practice of workforce training using machine learning tools, which requires low-level statistical and algorithmic details, is a barrier that needs to be addressed. Similar to data management languages such as SQL, machine learning needs to be practiced at a conceptual level to help make it a staple tool for general users. In particular, the technical sophistication demanded by existing machine learning frameworks is prohibitive for many scientists who are not computationally savvy or well versed in machine learning techniques. The learning curve to use the needed machine learning tools is also too high for them to take advantage of these powerful platforms to rapidly advance science. In this paper, we introduce a new declarative machine learning query language, called {\em MQL}, for naive users. We discuss its merit and possible ways of implementing it over a traditional relational database system. We discuss two materials science experiments implemented using MQL on a materials science workflow system called MatFlow.

翻译：数据科学作为一门学科的普及及其在新兴经济和工业进步中的重要性，决定了机器学习必须向大众普及。这也意味着当前使用机器学习工具进行劳动力培训的实践——这需要低层次的统计和算法细节——是一个需要解决的障碍。类似于SQL等数据管理语言，机器学习需要在概念层面进行实践，以帮助其成为普通用户的主流工具。特别是，现有机器学习框架所要求的技术复杂性，对于许多不精通计算或不熟悉机器学习技术的科学家来说是难以逾越的。使用所需机器学习工具的学习曲线也过高，使他们无法利用这些强大平台快速推进科学进展。在本文中，我们为新手用户介绍了一种新的声明式机器学习查询语言，称为{\em MQL}。我们讨论了它的优点以及在传统关系数据库系统上实现它的可能方式。我们还讨论了使用MQL在名为MatFlow的材料科学工作流系统上实现的两个材料科学实验。

相关内容

Machine Learning

关注 2251

机器学习（Machine Learning）是一个研究计算学习方法的国际论坛。该杂志发表文章，报告广泛的学习方法应用于各种学习问题的实质性结果。该杂志的特色论文描述研究的问题和方法，应用研究和研究方法的问题。有关学习问题或方法的论文通过实证研究、理论分析或与心理现象的比较提供了坚实的支持。应用论文展示了如何应用学习方法来解决重要的应用问题。研究方法论文改进了机器学习的研究方法。所有的论文都以其他研究人员可以验证或复制的方式描述了支持证据。论文还详细说明了学习的组成部分，并讨论了关于知识表示和性能任务的假设。官网地址：http://dblp.uni-trier.de/db/journals/ml/

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日