scaling cloud web & data technologies
TRANSCRIPT
www.anant.us | [email protected] | 202.905.28181010 Wisconsin Ave, NW | Suite 250 | Washington, DC 20007
Streamline Organize UnifyBusiness Information
Scaling Cloud Web & Data TechnologiesDistributed Platform:
Spark, Cassandra, Kafka, Docker
• Who are these guys? (Intro)• Why Distribute Everything?• Spark: Distributed Computing• Cassandra: Distributed Data• Kafka: Distributed Queues• Docker: Simplified Code Distribution• Questions ?
Agenda
Who am I. What I did. What I do.
Built a Hosting CompanyWeb Application Dev/HostingIT Consulting w/ Firms/Agencies……………………………………...……...Management Consulting for Internet Teams + Difficult Data
What we do well.
Streamline, Organize & Unify
Business Information
What is Business Information
Internal vs. External (Public vs. Private)Strategic / Tactical / OperationalData / Information / KnowledgeDocuments / Records / Metrics
Project
InformationClient Service
Information
Corporate
Guides
Collaborative
Documents
Assets
& Files
Corporate
Assets
Phases of Business Technology
Business
Silos
Standardize
d Platform
Optimized
Core
Business
Modularity
Your Business Cloud is Disconnected
… so you and your team spend waste time ...
Bring it all together in One Place
Project
InformationClient Service
Information
Corporate
Guides
Collaborative
Documents
Assets
& Files
Corporate
Assets
Konotree
We will help you connect and organize your data.
find. analyze. act.
FIND
DISCOVER
FILTER
ANALYZE
VISUALIZE
MEASURE
ACT
USE
SHARE
Why Distribute Everything?
• Pros – Scale– Speed– Concurrency– Volume– Modularity
• Cons – Complexity– Latency– Parity
Systems Amateurity
• Gonna host a ..:
– Project Management System ?
– Chat Server ?
– Frontend / Backend Framework ?
– Blog System / CMS / eCommerce ?
– Social Network ?
Systems Amateurity
• Equivalent to restaurants making their own pots and pans
• Common in many teams - small to large
• “I can SELF-HOST EVERYTHING”
Kafka
• 20-30 Times More Throughput
• PubSub or Message Queue
• Persistent w/ Offset (Replays)
• Messaging / ESB / Integration
• Seen in real world ~ 15TB / Day
Spark
• Extremely Fast Processing
• Program in Java, Scala, Python, R, or C#
• Sophisticated Algorithms Available: PageRank in 5 lines.
• Real Time Stream Processing
• Awesome Integration with Hadoop or Other Data Sources
Cassandra
• Massive Amounts of Data Quickly
• Distributed Architecture - Master - Master
• Tunable Consistency - CAP
• CQL ~ SQL - Familiar / Similar Syntax
• Easily Scale DB with Additional Nodes
• Easily add Replicas in additional DCs
• Hybrid of Schema / Schema-less
• Low Barrier to Entry
Docker
• Derivable Images
• Immutable Containers
• Same Docker works Everywhere
• Same Compose works Everywhere
• Industry Wide Adoption
• Thousands of Docker Images Available
• Thousands of Compose Files Available
• Runs on Linux or Windows (Natively)
• Tested with * on Linux
• Tested with DotNet core on Linux• Tested with Legacy .NET/MSSQL on Windows
Docker Makes it Easy to Self Host
Docker Makes it Easy to Self Host
Docker Makes it Easy to Self Host
Appleseed Framework
What we do well.
Streamline, Organize & Unify
Business Information
Ongoing Subscription + Kickstart
KICKSTART*- Discovery*- Workshop- Architecture*- Review- Engineering*- Delivery
SUBSCRIPTION- Support- Advisory / Guidance- Maintenance*- Config / Customization
- Managed*- Setup / Train / Admin
*service optionsFree Consultation
www.anant.us | [email protected] | 202.905.28181010 Wisconsin Ave, NW | Suite 250 | Washington, DC 20007
Streamlined Data
Integration / Data Pipelines
Organized Knowledge
Search / Data Warehouses
Unified Interfaces
Portals / Dashboards / Mobile
Free Consultation