人工智能在科学计算中的应用调研 · deep filtering was used in the first detection of...

53
杨骏韬, 张雪萌, 2018117人工智能在科学计算中的应用调研

Upload: others

Post on 12-Sep-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: 人工智能在科学计算中的应用调研 · deep filtering was used in the first detection of gravitational wave. Numerical simulated data was used for training deep filtering,

杨骏韬, 张雪萌, 2018年11月7日

人工智能在科学计算中的应用调研

Page 2: 人工智能在科学计算中的应用调研 · deep filtering was used in the first detection of gravitational wave. Numerical simulated data was used for training deep filtering,

2

议程

• 团队介绍

• 计算力学

• 地球科学

• 计算物理/计算化学

• 生命科学

• 总结

Page 3: 人工智能在科学计算中的应用调研 · deep filtering was used in the first detection of gravitational wave. Numerical simulated data was used for training deep filtering,

3

团队介绍

NVIDIA AI Technology Center (NVAITC) carries out both core and applied research for various domains

It started from Singapore with funding from Singapore government and now expanding to many more countries and regions including Australia, India, European countries like Luxemburg and growing.

One core project: Application of Deep Learning to traditional HPC

We wrote a survey paper on the current state-of-art for this domain and will be presenting to you interesting applications of ML/DL in computational sciences today.

Page 4: 人工智能在科学计算中的应用调研 · deep filtering was used in the first detection of gravitational wave. Numerical simulated data was used for training deep filtering,

4

领域分类

计算力学 地球科学 生命科学 计算物理 计算化学

Computational

Fluid Mechanics

Climate Modeling Genomics Particle Science Quantum

Chemistry

Computational

Solid Mechanics

Weather Modeling Proteomics Astrophysics Molecular

Dynamics

Ocean Modeling

Seismic

Interpretation

Page 5: 人工智能在科学计算中的应用调研 · deep filtering was used in the first detection of gravitational wave. Numerical simulated data was used for training deep filtering,

5

计算力学

Page 6: 人工智能在科学计算中的应用调研 · deep filtering was used in the first detection of gravitational wave. Numerical simulated data was used for training deep filtering,

6

CLASSIFICATION OF MACHINE LEARNING FRAMEWORKS FOR COMPUTATIONAL MECHANICS

• Closure models are independently built upon data and then implemented in conservation equations

Physics-separated ML

• Framework based on prior knowledge and data become references to inform simulation to achieve data-model consistency

Physics-evaluated ML

• ML-based closure models are embedded and trained in conservation equations

Physics-integrated ML

• Recovering form of governing equations from data

Physics-recovered ML

• End-to-end ML that ultimately relies on learning algorithms to figure out hidden physics given considerable amount of data

Physics-discovered ML

Classification of Machine Learning Frameworks for Data-Driven Thermal Fluid Models, Chih-Wei Chang, Nam T. Dinh, North Carolina State University

Page 7: 人工智能在科学计算中的应用调研 · deep filtering was used in the first detection of gravitational wave. Numerical simulated data was used for training deep filtering,

7

CLASSIFICATION OF MACHINE LEARNING FRAMEWORKS FOR COMPUTATIONAL MECHANICS

Classification of Machine Learning Frameworks for Data-Driven Thermal Fluid Models, Chih-Wei Chang, Nam T. Dinh, North Carolina State University

Classification Criteria Type I Type II Type III Type IV Type V

Is PDE involved Yes Yes Yes Yes No

Is the form of PDEs given Yes Yes Yes No No

Is the PDE involved in training No No Yes No No

Is a scale separation assumption required Yes No No No No

Page 8: 人工智能在科学计算中的应用调研 · deep filtering was used in the first detection of gravitational wave. Numerical simulated data was used for training deep filtering,

8

EULERIAN FLUID SIMULATION WITH NEURAL-NETWORK

AI for Fluid Mechanics

Accelerating Eulerian Fluid Simulation with Neuro-Networks

Acceleration of traditional Eulerian Fluid Simulation with Neuro-Network has been attempted by some researchers

The most computing costly pressure projection step is replaced with trained neuron-network

Convolutional Network has been tested and shown positive acceleration within reasonable error in the most recent publications.

Data Driven projection method in fluid simulation [Cheng Yang et al. 2016]

Accelerating Fluid Simulation with Convolutional Network [Tompson et al. ICML 2017]

Page 9: 人工智能在科学计算中的应用调研 · deep filtering was used in the first detection of gravitational wave. Numerical simulated data was used for training deep filtering,

9

TURBULENCE MODELING WITH MACHINE LEARNING TECHNIQUES

AI for Fluid Mechanics

RANS method couple with machine learning techniques has been new frontier for turbulence modeling

The idea is to use machine learning techniques to learn from data generated by computational expensive DNS and add the term into RANS model to improve the accuracy of turbulence modeling

RANS results are used as import and DNS results are used as label to update the model.

Tensor Basis Neural Network(TBNN) is proposed by Julia Ling and et al. (J.Fluid

Mech 2016)

Inverse Modeling Framework propsed by Universrity Michigan from “Machine Learning Methods for Data-driven turbulence modeling”, Zhang and Duraisamy (2015)

Page 10: 人工智能在科学计算中的应用调研 · deep filtering was used in the first detection of gravitational wave. Numerical simulated data was used for training deep filtering,

10

• Rudy, et al. Create a large internal library from data and derivatives and a sparse regression method capable of discovering the governing partial differential equations of a given system by time series measurements in the spatial domain.

PDE-FIND: DATA-DRIVEN DISCOVERY OF PARTIAL DIFFERENTIAL EQUATIONS

AI for Fluid Mechanics

Page 11: 人工智能在科学计算中的应用调研 · deep filtering was used in the first detection of gravitational wave. Numerical simulated data was used for training deep filtering,

11

CONVOLUTIONAL NEURAL NETWORKS FOR STEADY FLOW APPROXIMATION

AI for Fluid Mechanics

A quick general CNN-based approximation model for predicting the velocity field of non-uniform steady laminar flow by Guo, et al. (2016)

CNN-based approximation model trained by BLM simulation results

SFD data is used as import and error is used as lost function to train the convolutional neural networks.

82 seconds on a single core CPU to 7 milliseconds by leveraging both CNN and GPU at the cost of a low 1.98% to 2.69% error rate

CNN based CFD surrogate model architectureResults comparison between LBM model and CNN based surrogate model

Page 12: 人工智能在科学计算中的应用调研 · deep filtering was used in the first detection of gravitational wave. Numerical simulated data was used for training deep filtering,

12

INTERACTIVE FLUID SIMULATION WITH REGRESSION FOREST

AI for Fluid Mechanics

Fluid Simulation with Trained Regression Forest [Ladicky et al, 2015]

Regression Forest model trained with data generated with SPH method

Realtime simulation generated by trained regression forest with GPU acceleration

Data driven fluid simulation using regression forests

Page 13: 人工智能在科学计算中的应用调研 · deep filtering was used in the first detection of gravitational wave. Numerical simulated data was used for training deep filtering,

13

Mantaflow is an open-source framework targeted at fluid simulation research. It allows coupling and import/export with deep learning frameworks. It is currently being developed and maintained by Technical University of Munich. Below are some publications used manta flow for research on deep learning coupled fluid simulation reserach

Data-driven Synthesis of Smoke Flows with CNN-based Feature Descriptor

Accelerating Fluid Simulation with Convolutional Network

Mantaflow framework for fluid simulation

Framework from data-driven synthesis of smoke flows with CNN-based feature descriptor [Chen, et al, 2016]

MANTAFLOW – SIMULATION PACKAGE WORK WITH DEEP LEARNING

AI for Fluid Mechanics

Page 14: 人工智能在科学计算中的应用调研 · deep filtering was used in the first detection of gravitational wave. Numerical simulated data was used for training deep filtering,

14

FEA SIMULATION DATA TRAINED NEURAL NETWORK

AI for Fluid Mechanics

FEA trained deep neural network for surrogate modelling of estimated stress distribution. Deepvirtuality, a spinoff from Volkswagen Data:Lab under NVIDIA Inception Program has demonstrate with their software aimed for a quicker prediction of structural data.

An demonstration of Structure Born Noise of a V12 Engine with Deepvirtuality Torsional Frequencies of a Car Body by Deepvirtuallity

Page 15: 人工智能在科学计算中的应用调研 · deep filtering was used in the first detection of gravitational wave. Numerical simulated data was used for training deep filtering,

15

FEA UPDATED WITH NEURAL NETWORK IN BIO-TISSUE

AI for Fluid Mechanics

FEA trained deep neural network for surrogate modelling of estimated stress distribution. Traditional machine learning method has been used before, now deep learning techniques has been attempted for such model.

FEA generated stress distribution data is feed into neural network to train the neural network for fast stress distribution estimation. (Liang et al, 2018)

Ensembled decision tree model has also been applied for FEA update in “Machine Learning for modeling the biomechanical behavior of human soft tissue”. Data driven simulation has been done on Liver and Breast tissue. (Martin-Guerrereo, 2016)

Neural Network used for stress mapping by Liang Liang et al (2018)

FEA model for Liver from “Machine Learning for modelling the biomechanical behavior of human soft tissue” (Martin-Guerrereo, 2016)

Page 16: 人工智能在科学计算中的应用调研 · deep filtering was used in the first detection of gravitational wave. Numerical simulated data was used for training deep filtering,

16

地球科学

Page 17: 人工智能在科学计算中的应用调研 · deep filtering was used in the first detection of gravitational wave. Numerical simulated data was used for training deep filtering,

17

FEA UPDATED WITH NEURAL NETWORK IN BIO-TISSUE

AI for Earth Science

Identifying “extreme” weather events in multi-decadal datasets with 5-layered Convolutional Neural Network. Reaching 99.98% of detection accuracy. (Kim et al, 2017)

Systemic framework for detection and localization of extreme climate event

Dataset: Visualization of historic cyclones from JWTC hurricane report from 1979 to 2016

Page 18: 人工智能在科学计算中的应用调研 · deep filtering was used in the first detection of gravitational wave. Numerical simulated data was used for training deep filtering,

18

EMULATING RRTMG WITH DEEP NEURAL NETWORKS FOR THE ENERGY EXASCALE EARTH SYSTEM MODEL

Rapid Radiation Transfer Model for GCMs(RRTMG) is the most time consuming component of General Circulation Models(GCMs).

Oak Ridge National Laboratory made use of Deep Neural Network to learn from RRTMG model.

GCM for climate modeling

Short Wave Test Results

Long Wave Test Results

AI for Earth Science

Page 19: 人工智能在科学计算中的应用调研 · deep filtering was used in the first detection of gravitational wave. Numerical simulated data was used for training deep filtering,

19

PREDICTING PRECIPITATION FROM RADAR DATA

AI for Earth Science

Dataset: Visualization of historic cyclones from JWTC hurricane report from 1979 to 2016

• Uses conv LSTM nodes to take 101x101x4 input data from weather radar

• Trained with 2 years of data

• Final result shows RSME of 11.31%, lower than results with linear regression or FCN

Conv-LSTM Architecture

A few more architects are newly proposed to deal with such tempo-spatial prediction problem

- ConvGRU/ConvLSTM/PredRNN and many more variation of RNN models

Page 20: 人工智能在科学计算中的应用调研 · deep filtering was used in the first detection of gravitational wave. Numerical simulated data was used for training deep filtering,

20

GAE+RNN FOR IMPROVED SPATIOTEMPORAL DATA

Multiple climate sets covering different places and time: Combining them is a huge challenge (Seo et al, 2017)

New network handles both spatial and temporal properties together to solve this problem.

GAE-RNN model architecture Forecasting of temperature

AI for Earth Science

Page 21: 人工智能在科学计算中的应用调研 · deep filtering was used in the first detection of gravitational wave. Numerical simulated data was used for training deep filtering,

21

DEEP LEARNING FOR SEISMIC EVENTS

Detecting earthquakes from seismic data [Perol, et al]

20x improvement in detection vs manual.

Orders of magnitude faster

AI for Earth Science

Page 22: 人工智能在科学计算中的应用调研 · deep filtering was used in the first detection of gravitational wave. Numerical simulated data was used for training deep filtering,

22

计算物理

Page 23: 人工智能在科学计算中的应用调研 · deep filtering was used in the first detection of gravitational wave. Numerical simulated data was used for training deep filtering,

23

DEEP LEARNING IN HIGH ENERGY PHYSICS - CERN

AI for Computational Physics

Challenges:

• HL-LHC (High-Luminosity Large Hadron Collider) project, the ever increasing event complexity

• Model Independent Searches

Deep Learning Solutions for:

• Triggering on rare signals

• Faster data processing and simulation

• Pattern recognition to extract physics content

• Lower Energy Computation

• Unsupervised Learning to Search for New Physics

Page 24: 人工智能在科学计算中的应用调研 · deep filtering was used in the first detection of gravitational wave. Numerical simulated data was used for training deep filtering,

24

TOWARDS AN AI PHYSICIST FOR UNSUPERVISED LEARNING

AI for Computational Physics

AI Physicist agent architecture, Tailin Wu, Max Tegmark, MIT Experiment: Training environment for AI physicist

Theories

Divide and Conquer

Occam’s Razor

Unification

Lifelong Learning

Page 25: 人工智能在科学计算中的应用调研 · deep filtering was used in the first detection of gravitational wave. Numerical simulated data was used for training deep filtering,

25

DEEP LEARNING AND THE SCHRODINGER EQUATION

AI for Computational Physics

ConvNN is used to be trained for solving Schrodinger equation.

ConvNN is trained with simulation data to predict the ground-state energy of an electron in four classes of confining two-dimentional electrostatic potential

Deep learning and the Schrodinger equation

Page 26: 人工智能在科学计算中的应用调研 · deep filtering was used in the first detection of gravitational wave. Numerical simulated data was used for training deep filtering,

26

DEEP LEARNING FOR GRAVITATIONAL WAVE DETECTION

AI for Computational Physics

Deep learning method named deep filtering was used in the first detection of gravitational wave. Numerical simulated data was used for training deep filtering, a convolutional neural network to replace matched filtering. It provided 20X speed up on single core and potential to be accelerated further with GPU.

Gravitational wave due to black hole collide and merge

LIGO facility

To be observed

Actual Signal Caused by Gravitational Wave Actual observed data

How to findThe signal???

Deep Learning

Page 27: 人工智能在科学计算中的应用调研 · deep filtering was used in the first detection of gravitational wave. Numerical simulated data was used for training deep filtering,

27

计算化学

Page 28: 人工智能在科学计算中的应用调研 · deep filtering was used in the first detection of gravitational wave. Numerical simulated data was used for training deep filtering,

28

SHARING AND USING DATASETS FOR DRUG DISCOVERY

AI for Computational Chemistry

• DeepChem: Open Source framework for drug discovery using deep learning

• MoleculeNet: System for using/benchmarking using DeepChem – The “ImageNet” of Chemistry

• “Smart” splitters for training/validation/testing

• 17 curated datasets containing > 700,000 compounds

• Selection of featurizers and models

Page 29: 人工智能在科学计算中的应用调研 · deep filtering was used in the first detection of gravitational wave. Numerical simulated data was used for training deep filtering,

29

CHEMCEPTION: THE QC VERSION OF INCEPTION

AI for Computational Chemistry

• Goh, et al. devised a CNN called Chemception that can perform all predictive requirements (toxicity, activity, solvation) as well as current expert QSAR/QSPR for a complex molecule (HIV) after being trained for only 24 hours on a single GTX 1080

Page 30: 人工智能在科学计算中的应用调研 · deep filtering was used in the first detection of gravitational wave. Numerical simulated data was used for training deep filtering,

30

PREDICTING MD ENERGIES

AI for Computational Chemistry

• Schutt, et al. devised a DTNN that can calculate the chemical space for a medium-sized molecule with an max error of 1 kcal/mol.

Page 31: 人工智能在科学计算中的应用调研 · deep filtering was used in the first detection of gravitational wave. Numerical simulated data was used for training deep filtering,

31

生命科学

Page 32: 人工智能在科学计算中的应用调研 · deep filtering was used in the first detection of gravitational wave. Numerical simulated data was used for training deep filtering,

32

EXPLOSION IN GENOME DATA

Deep Learning for Genomics

MinIon SmidgION

Page 33: 人工智能在科学计算中的应用调研 · deep filtering was used in the first detection of gravitational wave. Numerical simulated data was used for training deep filtering,

33

DECODE THE HUMAN GENOME BY DEEP LEARNING

Decoding DNA words and grammars that specify tissue-specific control elements

Deep Learning for Genomics

‘Motif Discovery’

Regulatory proteins bind DNA words (landing pads) in control elements!

source from Anshul Kundaje’s presentation online

Page 34: 人工智能在科学计算中的应用调研 · deep filtering was used in the first detection of gravitational wave. Numerical simulated data was used for training deep filtering,

34

CONVOLUTIONAL NEURAL NETWORKS (CNN)

Deep Learning for Genomics

source from: http://cs231n.github.io/convolutional-networks/#overview

Page 35: 人工智能在科学计算中的应用调研 · deep filtering was used in the first detection of gravitational wave. Numerical simulated data was used for training deep filtering,

35

CONVOLUTIONAL NEURAL NETWORKS (CNN)

Deep Learning for Genomics

source: deeplearning.standford.edu

Page 36: 人工智能在科学计算中的应用调研 · deep filtering was used in the first detection of gravitational wave. Numerical simulated data was used for training deep filtering,

36

DEEP CNN ON DNA SEQUENCE INPUTS

Deep Learning for Genomics

source from Anshul Kundaje’s presentation online

Page 37: 人工智能在科学计算中的应用调研 · deep filtering was used in the first detection of gravitational wave. Numerical simulated data was used for training deep filtering,

37

MULTI-TASK DEEP CNNS LEARN DISCRIMINATIVE DNA WORD PATTERN DETECTORS

Deep Learning for Genomics

source from Anshul Kundaje’s presentation online

Page 38: 人工智能在科学计算中的应用调研 · deep filtering was used in the first detection of gravitational wave. Numerical simulated data was used for training deep filtering,

38

IDENTIFY IMPORTANT PARTS OF THE INPUT SEQUENCES

Efficient “Backpropagation” based approaches

DeepLIFT identifies combinatorial grammars of DNA words defining tissue-specific control elements!

Deep Learning for Genomics

Shrikumar et al.

https://arxiv.org/abs/1704.02685

CODE:

https://github.com/kundajelab/deeplift

source from Anshul Kundaje’s presentation online

Page 39: 人工智能在科学计算中的应用调研 · deep filtering was used in the first detection of gravitational wave. Numerical simulated data was used for training deep filtering,

39

DEEP CNNS CAN PREDICT AND INTERPRET EFFECTS OF DISEASE-ASSOCIATED GENETIC VARIANTS IN RELEVANT TISSUE CONTEXT

Deep Learning for Genomics

source from Anshul Kundaje’s presentation online

Page 40: 人工智能在科学计算中的应用调研 · deep filtering was used in the first detection of gravitational wave. Numerical simulated data was used for training deep filtering,

40

FUTURE OF PERSONALIZED MEDICINE

Deep Learning for Genomics

source from Anshul Kundaje’s presentation online

Page 41: 人工智能在科学计算中的应用调研 · deep filtering was used in the first detection of gravitational wave. Numerical simulated data was used for training deep filtering,

41

DEEPBIND

Deep Learning for Genomics

DeepBind’s input data, training procedure and applications

Predicting the sequence specifies of DNA- and RNA-binding proteins by deep learning

Alipanahi Babak et al. Nature Biotechnology

Page 42: 人工智能在科学计算中的应用调研 · deep filtering was used in the first detection of gravitational wave. Numerical simulated data was used for training deep filtering,

42

DEEPBIND

Deep Learning for Genomics

Predicting the sequence specifies of DNA- and RNA-binding proteins by deep learning

Alipanahi Babak et al. Nature Biotechnology

Page 43: 人工智能在科学计算中的应用调研 · deep filtering was used in the first detection of gravitational wave. Numerical simulated data was used for training deep filtering,

43

DEEPVARIANT

Deep Learning for Genomics

A universal SNP and small-indel variant caller using deep neural networks

Ryan Poplin et al. Nature Biotechnology

Page 44: 人工智能在科学计算中的应用调研 · deep filtering was used in the first detection of gravitational wave. Numerical simulated data was used for training deep filtering,

44

DEEPSEA

Deep Learning for Genomics

Predicting effects of noncoding variants with deep learning–based sequence model

Jian Zhou et al. Nature Methods

Page 45: 人工智能在科学计算中的应用调研 · deep filtering was used in the first detection of gravitational wave. Numerical simulated data was used for training deep filtering,

45

IMAGE & LANGUAGE: IMAGE CAPTIONING

source: Karpathy & Fei-Fei, CVPR 2015

Deep Learning for Proteomics

Page 46: 人工智能在科学计算中的应用调研 · deep filtering was used in the first detection of gravitational wave. Numerical simulated data was used for training deep filtering,

46

De novo Peptide Sequencing by Deep LearningPAGE 46

Neural Networks and Deep LearningDE NOVO PEPTIDE SEQUENCING BY DEEP LEARNING

Deep Learning for Proteomics

Page 47: 人工智能在科学计算中的应用调研 · deep filtering was used in the first detection of gravitational wave. Numerical simulated data was used for training deep filtering,

47

NEURAL NETWORKS FOR SEQUENCE DATA

Deep Learning for Proteomics

Page 48: 人工智能在科学计算中的应用调研 · deep filtering was used in the first detection of gravitational wave. Numerical simulated data was used for training deep filtering,

48

RECURRENT NEURAL NETWORKS (RNN)

Deep Learning for Proteomics

Page 49: 人工智能在科学计算中的应用调研 · deep filtering was used in the first detection of gravitational wave. Numerical simulated data was used for training deep filtering,

49

LONG SHORT-TERM MEMORY (LSTM) NETWORKS

Deep Learning for Proteomics

Page 50: 人工智能在科学计算中的应用调研 · deep filtering was used in the first detection of gravitational wave. Numerical simulated data was used for training deep filtering,

50

DE NOVO PEPTIDE SEQUENCING BY DEEP LEARNING

Deep Learning for Proteomics

Spectrum Captioning?

Ngoc Hieu Tran, Xianglilan Zhang, et al. University of Waterloo

Page 51: 人工智能在科学计算中的应用调研 · deep filtering was used in the first detection of gravitational wave. Numerical simulated data was used for training deep filtering,

51

DE NOVO PEPTIDE SEQUENCING BY DEEP LEARNING

Deep Learning for Proteomics

DeepNovo Architecture

• ion-CNN, spectrum-CNN, LSTM

• Knapsack Dynamic Programming

• Peptide mass, prefix mass, suffix mass

• All amino acid combinations for a given total mass

• Filter out amino acids with unsuitable mass

• Beam Search: explore a fixed number of top

candidate sequences at each iteration.

• Bi-directional Sequencing

Ngoc Hieu Tran, Xianglilan Zhang, et al. University of Waterloo

Page 52: 人工智能在科学计算中的应用调研 · deep filtering was used in the first detection of gravitational wave. Numerical simulated data was used for training deep filtering,

52

DeepNovo: First deep-learning application to de novo peptide sequencing

Architecture: CNN + LSTM + Knapsack dynamic programming

Major improvement of accuracy over the state-of-the-art:

7.7–22.9% at the amino acid level

38.1–64.0% at the peptide level

Enable complete assembly of novel protein sequences without assisting databases

Re-trainable & complete end-to-end training and prediction

High-performance computing with GPUs and CPUs

De novo Peptide Sequencing by Deep LearningPAGE 52

DE NOVO PEPTIDE SEQUENCING BY DEEP LEARNING

Deep Learning for Proteomics

Ngoc Hieu Tran, Xianglilan Zhang, et al. University of Waterloo

Page 53: 人工智能在科学计算中的应用调研 · deep filtering was used in the first detection of gravitational wave. Numerical simulated data was used for training deep filtering,

53

谢谢

Jeff AdiePrincipal Solution Architect

Singapore

Simon SeeDirector,

Solution Architect

致谢