sas: open for business (and learning) group presentation… · visual analytics –register for...
TRANSCRIPT
Copyr i g ht © 2015, SAS Ins t i tu t e Inc . A l l r ights reser ve d .
SAS: OPEN FOR BUSINESS (AND LEARNING)
MARK MORREALE,
NATIONAL LEAD, ACADEMIC PROGRAM
Copyr i g ht © 2015, SAS Ins t i tu t e Inc . A l l r ights reser ve d .
VISUAL ANALYTICS &
VISUAL STATISTICS 14-DAY FREE CLOUD TRIAL, UP TO 5 USERS
Your Trial, Your Data
Visual Analytics – Register for Trial• Smart data exploration with self-services analytics makes this product usable for anyone.
Interactive reporting makes it collaborative. Scalability and governance make it fit the needs of
your organization, no matter the size.
Visual Statistics – Register for Trial• Multiple users can explore and visualize data, then interactively create and refine descriptive
and predictive models. Distributed, in-memory processing reduces model development time
so you can run complex analytic computations – and get precise results – in minutes.
Copyr i g ht © 2015, SAS Ins t i tu t e Inc . A l l r ights reser ve d .
FRIENDS OF SAS
Questions? contact [email protected]
Copyr i g ht © 2015, SAS Ins t i tu t e Inc . A l l r ights reser ve d .
ACKNOWLEDGEMENTS
MATT MALCZEWSKI, SAS COMMUNITIES
TAMARA DULL, SAS BEST PRACTICES
STEVE HOLDER, NATIONAL ANALYTICS LEAD, SAS CANADA
Copyr i g ht © 2015, SAS Ins t i tu t e Inc . A l l r ights reser ve d .
SAS & OPEN MARK’S THREE POINTS
EMBRACEopen source by including it
and leveraging it where we
can
EXTENDopen source by improving
its interoperability and
utility for the enterprise
SAS
Copyr i g ht © 2015, SAS Ins t i tu t e Inc . A l l r ights reser ve d .
MythCommercial software companies are afraid of open source.
Copyr i g ht © 2015, SAS Ins t i tu t e Inc . A l l r ights reser ve d .
OPEN SOURCE
CERTIFICATION
Open source is the software for which the
original source code is made freely available
with an open source license for redistribution
and modification by anyone.
The pre-written programs or functions can
be used as is or changed to fit the user
specific need.
WHAT IS OPEN SOURCE?
Copyr i g ht © 2015, SAS Ins t i tu t e Inc . A l l r ights reser ve d .
OPEN SOURCE HAS BEEN AROUND FOR A LONG TIME
20152010200520001995199019851980
Python
Apache
Hadoop
Apache
Spark
Redhat Linux,
MySQL
R
Apache
Tomcat
Linux
Scala
Apache
Pig
Tensorflow
GitHub
Repository for
open source projects
Copyr i g ht © 2015, SAS Ins t i tu t e Inc . A l l r ights reser ve d .
OPEN SOURCE COMMERCIAL SOURCED
AT
AA
NA
LY
SIS
ENVIRONMENTSOPEN SOURCE
TECHNOLOGIES
Copyr i g ht © 2015, SAS Ins t i tu t e Inc . A l l r ights reser ve d .
Proprietary Open Source
WHO?
WHAT?
WHERE?
WHEN?
WHY?
HOW
MUCH?
paid employees, contractors led by
R&D, product management
volunteer group of peer developers
collaborating
software is under
restrictive copyright, source code is
usually hidden from users
source code is available for modification or
enhancement by anyone
companies, organizations foundations, dev communities
customer demand, market conditions developer(s) see/respond to a need
to make money, part of IP to give back
license, subscription model – $0 to $$$;
dev, support – $0 to $$$
software - $0;
dev, support - $0 to $$$
PROPRIETARY SOFTWARE COMPARED
Copyr i g ht © 2015, SAS Ins t i tu t e Inc . A l l r ights reser ve d .
COMPARISONS
• Open Source Offers:
+A robust online community.
+An extensive array of algorithms.
+Low cost barriers to entry.
+Fast adoption of new innovation.
• SAS Offers:
+Productivity for users regardless of
skillset.
+Scalability to address any problem or
dataset.
+Governed analytics and data.
+The support organizations require for
production and operational analytics.
Copyr i g ht © 2015, SAS Ins t i tu t e Inc . A l l r ights reser ve d .
HOW DO I PARTICIPATE IN OPEN SOURCE
Copyr i g ht © 2015, SAS Ins t i tu t e Inc . A l l r ights reser ve d .
I’M OPEN BECAUSE I’M ON GITHUB!
Copyr i g ht © 2015, SAS Ins t i tu t e Inc . A l l r ights reser ve d .
OPEN SOURCE
CERTIFICATION
• A web-based open source version control system that provides a repository for hosting open
source software projects.
• Users are able to discuss, manage, create repositories, submit contributions to others'
repositories, and review changes to code.
• Repositories can be public or private.
• There are more than 14 million users and 35 million repositories making it the largest host of
source code in the world.
WHAT IS GITHUB?
Copyr i g ht © 2015, SAS Ins t i tu t e Inc . A l l r ights reser ve d .
https://github.com/sassoftware
Copyr i g ht © 2015, SAS Ins t i tu t e Inc . A l l r ights reser ve d .
SAS ON GITHUB HOW SAS CONTRIBUTES
Dozens of SAS employees involved in these types of projects:
• Examples for learning
• Open source enhancements for SAS software
• Tools for software developers
Copyr i g ht © 2015, SAS Ins t i tu t e Inc . A l l r ights reser ve d .
#1 -- SAS EMBRACES OPEN SOURCE
Copyr i g ht © 2015, SAS Ins t i tu t e Inc . A l l r ights reser ve d .
SAS IS OPEN LET’S DEVELOP LIKE IT’S 1999
Copyr i g ht © 2015, SAS Ins t i tu t e Inc . A l l r ights reser ve d .
SAS UNIVERSITY
EDITION
Partial list of Open Source
• CentOS Linux
• Dojo Toolkit
• Apache Velocity scripting
• rPath packaging tech
Copyr i g ht © 2015, SAS Ins t i tu t e Inc . A l l r ights reser ve d .
LUA AND GROOVY
AND C3RD PARTY LANGUAGE INCLUSION
Copyr i g ht © 2015, SAS Ins t i tu t e Inc . A l l r ights reser ve d .
R AND JAVA 3RD PARTY LANGUAGE INTEGRATION
Copyr i g ht © 2015, SAS Ins t i tu t e Inc . A l l r ights reser ve d .
MORE OPENNESS PARTIAL LIST OF OTHER OPEN TOUCHPOINTS
• Hadoop
• MapR
• Apache YARN
• Apache Camel
• PMML
• Open Street Map
• MySQL
• PostgreSQL
• Amazon Redshift
• LaTeX
• Excel (XLSX)
• PowerPoint (PPTX)
• RTF
• Groovy
• Lua
• R
• Java
• XML
• JSON
• HTML/HTML5
• REST (HTTP)
Some of these are
“open source”
tech; others are
just ubiquitous
integration points
Copyr i g ht © 2015, SAS Ins t i tu t e Inc . A l l r ights reser ve d .
#2 -- SAS EXTENDS OPEN SOURCE
Copyr i g ht © 2015, SAS Ins t i tu t e Inc . A l l r ights reser ve d .
EMBRACE
EMBRACESAS leverages open source technologies
seamlessly with SAS product offerings.
EXTENDSAS facilitates interoperability with open
source technologies to provide enhanced or
more complete enterprise capabilities
COMPETESAS offers a direct and more viable
alternative to open source technology
EMBRACE EXTEND
EXTEND COMPETE
OPEN SOURCE COMMERCIAL SOURCED
AT
AA
NA
LY
SIS
ENVIRONMENTSSAS & OPEN SOURCE
Copyr i g ht © 2015, SAS Ins t i tu t e Inc . A l l r ights reser ve d .
OPEN SOURCE
SOLUTIONSFUNCTIONALITY MAPPING
Copyr i g ht © 2016, SAS Ins t i tu t e Inc . A l l r ights reser ve d .
SAS VIYA
31
SUPPORTING CURRENT INDUSTRY TRENDS
RESTful API’s
Multi-threaded hyper-computing
Memory spilloverScalable
Elastic
Easy installs
Charge-back capable
Advanced machine learning
Analytics lifecycle support
Integrated solutions
Micro-services
architecture
Plug n’ play
Python, Java, Lua support
Backward compatible ‘Any data, any platform’
End-to-end
Copyr i g ht © 2015, SAS Ins t i tu t e Inc . A l l r ights reser ve d .
SAS VIYA COMMUNITIES.SAS.COM AND GITHUB
Copyr i g ht © 2016, SAS Ins t i tu t e Inc . A l l r ights reser ve d .
FOR MORE INFORMATION
Empowering the SAS/IML user with the functionality of R
Documentation: IML User’s Guide - Calling Functions in the R Languagehttp://support.sas.com/documentation/cdl/en/imlug/66845/HTML/default/viewer.htm#imlug_r_toc.htm
Video: Calling R Procedures from SAS/IML® Softwarehttps://www.youtube.com/watch?v=rUaTTre24kI
Video: SAS/IML and R: Using Them Togetherhttps://www.youtube.com/watch?v=nmRQ3MtkG6A
Blogs: The DO Loop – R tagshttp://blogs.sas.com/content/iml/tag/r/
Paper (p 14-17): Rediscovering SAS/IML® Software: Modern Data Analysis for the Practicing Statisticianhttp://support.sas.com/resources/papers/proceedings10/329-2010.pdf
Article: Versions of R that are supported by SAS/IMLhttp://blogs.sas.com/content/iml/2013/09/16/what-versions-of-r-are-supported-by-sas.html
34
Copyr i g ht © 2016, SAS Ins t i tu t e Inc . A l l r ights reser ve d .
Video: Using R in SAS Enterprise Minerhttps://www.youtube.com/watch?v=TbXo0xQCqDw
Blogs: Spectral Clustering in SAS® Enterprise Miner™ Using Open Source Integration Nodehttps://communities.sas.com/docs/DOC-8011
Blogs: How to execute a Python script in SAS® Enterprise Miner™https://communities.sas.com/docs/DOC-10832
Blogs: Open Source Integration Using the Base SAS Java Objecthttps://communities.sas.com/docs/DOC-10746
Article: The Open Source Integration node installation cheat sheethttps://communities.sas.com/docs/DOC-9988
Usage Notes: http://support.sas.com/dsearch?Find=Search&ct=&qt=open+source&col=suppprd&nh=25&qp=&qc=suppsas&ws=1&q
m=1&st=1&lk=1&rf=0&oq=&rq=0
FOR MORE INFORMATION - EXTENDING R
35
Copyr i g ht © 2016, SAS Ins t i tu t e Inc . A l l r ights reser ve d .
FOR MORE INFORMATION MATERIALS ON GITHUB
36
Sas integration and sample codeIntegration with R, Pythonhttps://github.com/sassoftware/enlighten-integration
Integration with Jupyter Notebook and Pythonhttps://github.com/sassoftware/sas_kernel
https://github.com/sassoftware/saspy
Sample codes of SAS Machine Learning methodshttps://github.com/sassoftware/enlighten-apply
SAS Enterprise Miner process flow diagramshttps://github.com/sassoftware/dm-flow