s4ds newsletter · education, and training. build alliances with industry partners to promote...
TRANSCRIPT
Dear Friends, It gives me immense pleas-ure to present the first news-letter of the Society for Data Science…
S4DS is a non-profit profes-sional association to create a collaborative platform for bringing together technical experts across Industry, Academia, Government Labs and Professional Bod-ies to promote Innovation around Data Science. It is recognized as a “one of its kind” initiative rallying the technical community around an INDIA FIRST agenda. SD4S welcomes all who are interested in data science,
data mining, statistical mod-eling, big data analytics, machine learning, deep learning, artificial intelli-gence and the art of data visualization to turn it into a valuable asset for your aca-demic researches and data science careers. We are committed to serves mem-bers, improving the data science profession, eliminat-ing bias and enhancing di-versity, and advancing ethi-cal data science throughout the world.
We are happy to announce the flagship conference of S4DS, the biggest Data Sci-ence Conference in South-east Asia, which is
the International Confer-ence on Data Manage-ment, Analytics & Innova-tion (ICDMAI). ICDMAI was organized in the consecutive years of 2017 & 2018 in the IT city of Pune, India. The 3rd version in 2019 of ICDMAI will be hosted at Lincoln University College,
Malaysia and we are plan-ning to make it a big and an intellectually rich confer-ence.
This year, the main focus would be on strengthening the student community and enhancing the industry par-ticipation in S4DS activities across globe. Therefore, lets be proactive and supportive to everybody we are associ-ated with and to the organi-zation we are working for. You must resolve to take at least some time to apply our skills and strengths in bring-ing positive difference in the community. Let's begin our journey together with a PURPOSE…
Dr. Neha Sharma
Editor, S4DS Newsletter Secretary, S4DS
Editor’s Note
Mission of S4DS
Enriching the knowledge of students, researchers, companies, governments and
sponsors to structure their career to the glorious future in the field of Data Science.
Creating a pool of distinguished experts to transform all fields, professions, and sec-tors through the application of data science
Enhancing the academic ability to solve complicated business and societal problems through the analysis of large data sets
Adopting the unique multidisciplinary approach where domain specific knowledge is used to truly understand the power and limitations of data.
Ensuring the responsible use of data for the benefit of the society.
S4DS Newsletter
Jan—Jun 2018
Volume 1, Issue 1
INSIDE THIS ISSUE:
Editor’s Note 1
Vision & Mission of
S4DS
1
Execom Members of
S4DS
2
Objective of S4DS 2
Data Science Reality
at Present Arena
3
Membership of S4DS 8
S4DS Focus Areas 8
Establish a S4DS Stu-
dent Chapter
8
Vision of S4DS
Empowering people in the field of Data Science through spread of knowledge and wis-dom in an intelligent environment to im-prove life, business and government and prepare tomor-row’s data science researchers to ad-dress grand chal-lenge problems
Society for Data Science
Tech-empowering' India's need for Data Scientists
try partnerships for research, education, and training.
Build alliances with industry partners to promote beneficial relationships towards develop-ing solutions for practical prob-lems related to data science.
Seek to commercialize its tech-nology, methods, and patents related to data science to deliv-er solutions that will benefit industry and society at large.
Develop outreach programs to the community by creating new opportunities from data science.
Support a small numbers of new “Centers of Excellence” in the domain of data science.
Develop a Think Tank to advise and assist Government of In-
dia / state Governments and other such agencies for all data intensive projects with possible Data Science Solutions.
Provide an avenue for Data Science Certification.
Publish a Newsletter / Maga-zine/ Journal.
Organize Seminars and confer-ences regularly at different locations in the country as well as abroad.
Recognize the exemplary work done in the field of Data Science, Innovation and Start-ups
Setting standards for the ethical professional practice of data science.
Helping to shape a better future - not just for the powerful, but for the majority of people.
Provide a forum for interchange of ideas on Data Science among students, researchers, companies, governments along with other relevant stake hold-ers.
Promote and create aware-ness about the Data Science in the rapidly-changing world, as an area where there is lots of opportunities for placement and research, in India and worldwide by connecting indi-viduals, corporations and data scientists.
Gather data science enthusi-asts and empower them to be ready for data science careers through hands-on practical workshops, trainings, public events, and distinguished guest lectures, to assure base-level data scientist competency.
Provide an enjoyable and inter-active platform for real-life ap-plications of data analysis.
Advocate adoption of Data Sci-ence solutions for all most all the domains, with a special focus on social problems.
Develop and deliver education-al initiatives in the domain of data analytics and data scienc-es to support the next genera-tion of data scientists.
Develop government and indus-
Objectives of S4DS
Executive Committee of S4DS
Role of a data scientist has already
earned the moniker of “the hottest
job of the 21st century”. According
to a report by the McKinsey Global
Institute, there will be a shortage
of 140,000 to 190,000 data
science professionals by 2018 in
USA alone.
S4DS NEWSLETTER Page 2
Photo Name Designation
Dr. Amol Chandrabhan Goje President
Dr. Neha Sharma Secretary
Dr. Inderjit Singh Barara Member
Mr. Atul Bengiri Member
Photo Name Designation
Dr. Amlan Chakrabarti Vice President
Dr. Deepali Sawai Treasurer
Dr. D. H. Manjaiah Member
Dr. Saptarsi Goswami Academia –Industry
Alliance Chair
NASSCOM pegs revenues from data
science and AI (IT and non-IT
industries) in the country at $16
billion by 2025, providing jobs to
1.50 lakh professionals.
VOLUME 1, ISSUE 1
Definition of Data
Science:
Data science is an interdisciplinary
field of scientific methods, pro-
cesses, algorithms and systems to
extract knowledge or insights
from data in various forms, either
structured or unstructured, similar
to data mining.
The role of a data scientist is nor-
mally associated with tasks such as
predictive modeling, developing
segmentation algorithms, recom-
mender systems, A/B testing frame-
works and often working with raw
unstructured data.
The nature of their work demands a
deep understanding of mathemat-
ics, applied statistics and program-
ming. There are a few skills com-
mon between a data analyst and a
data scientist, for example, the abil-
ity to query databases. Both ana-
lyze data, but the decision of a data
scientist can have a greater impact
in an organization.
Here is a set of skills a data scien-
tist normally need to have:
Programming in a statistical pack-
age such as: R, Python, SAS,
SPSS, or Julia
Able to clean, extract, and ex-
plore data from different sources
Research, design, and implemen-
tation of statistical models
Deep statistical, mathematical,
and computer science knowledge
In big data analytics, people nor-
mally confuse the role of a data
scientist with that of a data archi-
tect. In reality, the difference is
quite simple. A data architect de-
fines the tools and the architecture
the data would be stored at, where-
as a data scientist uses this archi-
tecture. Of course, a data scientist
should be able to set up new tools if
needed for ad-hoc projects, but the
infrastructure definition and design
should not be a part of his task [1].
The importance of
data science :
Data science has over the past few
years come a really long way. That
is why they are integral part of un-
derstanding the working of many
industries, however complex and
intricate.
Here are ten reasons why data sci-
ence will always remain an integral
part of the culture and economy of
the global world [2]:
Data Science Reality at Present Arena
Page 3
Dr. MANJAIAH D. H
PROFESSOR & CHAIRMAN
DEPARTMENT OF PG STUDIES AND RESEARCH IN COMPUTER SCIENCE
MANGALORE UNIVERSITY, MANGALAGANGOTRI
MANGALORE: 574 199. INDIA.
9449444638 [M] / 0824 - 2888646 ( O )
[email protected] / [email protected]
Fig1: Structure of Data Science
1. Data science helps brands to understand their customers in a much enhanced and empowered manner.
Customers are the soul and base of any brand and have a great role to play in their success and failure. With
the use of data science, brands can connect with their customers in a personalized manner, thereby ensuring
better brand power and engagement.
2. One of the reasons why data science is gaining so much of attention is because it allows brands to com-
municate their story in such an engaging and powerful manner. When brands and companies utilize this data
in a comprehensive manner, they can share their story with their target audience, thereby creating better
brand connect. After all, nothing connects with consumers like an effective and powerful story that can incul-
cate all human emotions.
3. Big Data is a new field that is constantly growing and evolving. With so many tools being developed, al-
most on a regular basis, big data is helping brands and organizations to solve complex problems in
IT, human resource, and resource management in an effective and strategic manner. This means effective
use of resources, both material and non-material.
4. One of the most important aspect of data science is that its findings and results can be applied to almost
any sector like travel, healthcare and education among others. Understanding the implications of data sci-
ence can go a long way in helping sectors to analyses their challenges and address them in an effective
fashion.
5. Data science is accessible to almost all sectors. There is a large amount of data available in the world
today and utilizing them in a proper manner can spell success and failure for brands and organizations. Utiliz-
ing data in a proper manner will hold the key for achieving goals for brands, especially in the coming times.
That being said, data science is taking on a big and prime role in functioning and growth process of brands.
Being a data scientist is therefore a prime position for any person as they have the big task of managing data
and providing solutions for their problems, both within and outside the organization.
Applications / Uses of Data Science
Using data science, companies have become intelligent enough to push & sell products as per customer’s purchasing
power & interest. Here’s how they are ruling our hearts and minds [3]:
1. Internet Search
When we speak of search, we think ‘Google’. Right? But
there are many other search engines like Yahoo, Bing, Ask,
AOL, Duckduckgo etc. All these search engines (including
Google) make use of data science algorithms to deliver the
best result for our searched query in fraction of seconds.
Considering the fact that, Google processes more than 20
petabytes of data every day. Had there been no data sci-
ence, Google wouldn’t have been the ‘Google’ we know to-
day.
2. Digital Advertisements (Targeted Advertising and re-
targeting)
If you thought Search would have been the biggest application of data science and machine learning, here is a challeng-
Data Science Reality at Present Arena Contd...
S4DS NEWSLETTER Page 4
VOLUME 1, ISSUE 1
er – the entire digital marketing spectrum. Starting from the display banners on various websites to the digital bill boards
at the airports - almost all of them are decided by using data science algorithms. This is the reason why digital ads have
been able to get a lot higher CTR than traditional advertisements. They can be targeted based on user’s past behaviour.
This is the reason why I see ads of analytics trainings while my friend sees ad of apparels in the same place at the same
time.
3. Recommender Systems
Who can forget the suggestions about similar products on
Amazon? They not only help you find relevant products from
billions of products available with them, but also adds a lot
to the user experience. A lot of companies have fervidly
used this engine / system to promote their products / sug-
gestions in accordance with user’s interest and relevance of
information. Internet giants like Amazon, Twitter, Google
Play, Netflix, Linkedin, imdb and many more uses this sys-
tem to improve user experience. The recommendations are
made based on previous search results for a user.
4. Image Recognition :
You upload your image with friends on Facebook and you
start getting suggestions to tag your friends. This automatic
tag suggestion feature uses face recognition algorithm. Sim-
ilarly, while using whatsapp web, you scan a barcode in your web browser using your mobile phone. In addition, Google
provides you the option to search for images by uploading them. It uses image recognition and provides related search
results.
5. Speech Recognition :
Some of the best example of speech recognition products are Google Voice, Siri, Cortana etc. Using speech recognition
feature, even if you aren’t in a position to type a message, your life wouldn’t stop. Simply speak out the message and it
will be converted to text. However, at times, you would realize, speech recognition doesn’t perform accurately.
6. Gaming :
EA Sports, Zynga, Sony, Nintendo, Activision-Blizzard have led gaming experience to the next level using data science.
Games are now designed using machine learning algorithms which improve / upgrade themselves as the player moves
up to a higher level. In motion gaming also, your opponent (computer) analyzes your previous moves and accordingly
shapes up its game.
Challenges of Data Science
It is becoming increasingly apparent that data scientists need to demonstrate skills necessary to convert data-based sci-entific inference into accessible, actionable insights for business and upper level management. Today's data scientists need to both straddle the worlds of business boardrooms and IT as well as become a hybrid of them. But does their soft-ware support them in achieving these lofty goals? A data scientist worth his salt uses applications that help him surmount the three key challenges to his job [4]. 1. Multiple Data Sources The latent value of big data is best mined when data scientists can reach across the expanse of the data landscape and access data from multiple platforms and data sources. Deeper and often more meaningful insights can be gleaned, the more relevant data that the inquiries have at their disposal. With cloud-based, integrated data platforms like ClicData, virtual data warehouses can be 'built' that effectively connect data from numerous locations, arriving in a variety of for-mats, at different times, captured both in batch and in real-time. This more inclusive reach means more useful inferences and insights. 2. Customers Insist on Interacting The new data scientist needs to go beyond delivery of historically-driven reports and provide actionable answers in envi-ronments that give the customer control. ClicData full-featured dashboards allow data scientists to prioritize the metrics and indicators that are relevant to the strategic goals and objectives of the business, and communicate them in the lan-guage of C-level stakeholders. As such the work they do has a direct and immediate impact on the business.
Page 5
3. Communicating with Real People
These days, data scientists must do more than understand their data; they need to make their data understood by oth-ers. The results of their work are used to resolve business problems, create an efficient supply chain, automate of oper-ations, nourish customer relationships, launch revenue lines, and establish strategic competitive advantages. Dashboard software like ClicData offers a wide range of visualization widgets to make the data meaningful and actionable, choosing the right tool to graphically display and convey the crucial and supportive insights that are needed. Communication becomes automated with the ability to distribute results, reports and performance indicators to chosen groups or users. Notifications and alerts are set up according to predetermined conditions. As the business model em-braces collaboration, these tools are essential to business-wide communication.
Reference
https://www.educba.com/data-science-and-its-growing-importance/
https://www.analyticsvidhya.com/blog/2015/09/applications-data-science/
https://www.kdnuggets.com/2016/02/clicdata-3-biggest-challenges-data-scientist.html
Data Science Reality at Present Arena Contd...
S4DS NEWSLETTER Page 6
Forth Coming Event
3rd International Conference on Data Management, Analytics & Innovation (ICDMAI)
The biggest Data Science Conference in Southeast Asia, which is the International Conference on Data Management, Analytics & Innovation (ICDMAI) was organized in the consecutive years of 2017 & 2018 in the IT city of Pune, India. The 2019th version of ICDMAI will be hosted at Lincoln University College, Malaysia and we are planning to make it a big and an intellectually rich conference. Many eminent speakers are associated with ICDMAI 2019 and would share the plethora of knowledge during the conference: 1. Dr. P. K. Sinha, Vice Chancellor and Director, Dr S P Mukherjee International Institute of Information Technology, Naya Raipur (IIIT-NR), Chhattisgarh 2. Prof. Vincenzo, Piuri, Professor, University of Milano, Italy, IEEE Fellow 3. Prof. Janusz Kacprzyk, Professor, Systems Research Institute, Polish Academy of Sciences, Warsaw, Poland 4. Prof. Juergen Seitz, Head of Business Information Systems Department, Baden-Wuerttemberg Cooperative State Univer-sity, Heidenheim, Germany 5. Prof. Jan Martinovic, Head of Advanced Data Analysis and Simulations Lab at IT4 Innovations National Supercom-puting Centre of the Czech Republic, VŠB Technical University of Ostrava 6. Dr. Valentina Balas, Professor, “AurelVlaicu” University, Arad, Romania 7. Ms. Kranti Athalye, Sr. Manager, University Relations, IBM India Pvt.Ltd 8. Mr. Aninda Bose, Senior Publishing Editor, Springer India Pvt. Ltd. 9. Col. Inderjit Singh Barara, Technology Evangelist, Solution Architect and a Mentor
Technical Program Chairs
Neha Sharma Professor, Vidya Pratishthan’s Institute of Information Technology, (VIIT, Baramati, Pune), Savitribai Phule Pune University, India Amlan Chakrabarti Dean, Faculty of Engineering, University of Calcutta Director and Professor, A.K.Choudhury School of Information Technology, Kolkata Valentina Emilia Balas Professor, Department of Automatics and Applied Software faculty of Engineering, University of Arad, Romania
VOLUME 1, ISSUE 1
International Liason Chairs
Juergen Seitz Head of Business Information Systems Department, Baden-Wuerttemberg Cooperative State University, Heidenheim, Germany
Jan Martinovic Head of Advanced Data Analysis and Simulations Lab IT4 Innovations National Supercomputing Centre of the Czech Republic, VŠB Technical University of Ostrava
Industry Liason Chair
Col. Inderjit Singh Barara Technology Evangelist, Solution Architect and a Mentor
Tutorial and Workshop Chairs
Saptarsi Goswami A.K.Choudhury School of IT, University of Calcutta
Page 7
Sruti Das Choudhury University of Nebraska-Lincoln (UNL), USA
Ozen Ozer
Kirklareli University, Turkey
Publicity Chairs
Priyadarshi Kanungo, C.V. Raman College of University, Bhubaneswar
Jyoti Gautam, J.S.S. Academy of Technical Education, Noida
Manjaiah D. H. Professor, Department of Computer Science, Mangalore
University
Membership is open to data scientists, scientists, students, academicians, and others interested in science and the data science profession. Membership is available under three categories namely Academic Professional and Corporate Professional and Student. Membership Benefits
Free access to quarterly newsletter containing cutting edge updates in this domain
Discounted rate on workshops and Certification programmes on disruptive technologies, conducted by experts
Discounted registration fee to the flagship International Conference on Data Management, Analytics and Innovation (ICDMAI)
Access to global list of mentors, especially for students to jointly undertake writing of papers, developing projects and open source packages
Funding for quality student member’s innovative ideas
Opportunity to deliver workshop/training in collaboration with Data Science Society (S4DS)
Opportunity for networking with the S4DS members
Membership of S4DS
Dr. Neha Sharma,
Editor, S4DS Newsletter Secretary, S4DS
Phone: 9923602490 Email: [email protected]
Website: www.s4ds.org
Organization
SOCIETY FOR DATA
SCIENCE
Student Chapter can be established at any institution with 12 Student members and 03 Professional members. Benefits of Students Chapter
Organizing events at S4DS Student Chapter shall help members develop administrative, technical, managerial and humanitarian skills.
Establish center of excellence on many disruptive technologies.
Conduct Certification Programmes on disruptive technologies.
Resource and promotional support for Workshop, Lectures, Seminars, Conferences etc.
Organize Faculty Development Programmes and Technical Training Programmes.
Opportunity to organize activities to develop entrepreneurial skills, leadership skills, soft skills and other skill development activi-ties.
Opportunity to host or become partner in organizing the International Conference on Data Management, Analytics and Innovation (ICDMAI), a flagship conference of S4DS
Networking with S4DS team and other Student Chapters, may help in projects and placements.
Association with Technical Professional Body shall help in receiving recognition by Accreditation bodies.
Establish a S4DS Student Chapter