0% found this document useful (0 votes)
881 views110 pages

Landscape Analytics

Uploaded by

Rajat
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
881 views110 pages

Landscape Analytics

Uploaded by

Rajat
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 110

TRACXN REPORT : BIG DATA ANALYTICS

June 2016
Tracxn
World’s Largest Startup Research Platform

2 Big Data Analytics, June 2016


Contents
Topic Page
Sectors We Track 04
Overview 05
Tracxn BlueBox 19
Company List 36
Team 110

3 Big Data Analytics, June 2016


Illustrative Sectors We Track
ENTERPRISE ENTERPRISE
CONSUMER MOBILE
INFRASTRUCTURE APPLICATIONS

SECURITY SAAS MARKETPLACES MOBILE COMMERCE


STORAGE MOBILE-FIRST ENT. APPS SUBSCRIPTION COMM. MOBILE PAYMENTS
NETWORKING INTELLIGENT ENT. APPS FOOD TECH MOBILE MARKETING
MOBILITY OPEN SOURCE INTERNET FIRST BRANDS MOBILE DEV TOOLS
IT OPS RETAIL TECH UBER FOR X MOBILE HEALTH
CLOUD INFRASTRUCTURE MARKETING TECH SECOND HAND GOODS MOBILE GAMING
API MANAGEMENT BIG DATA ANALYTICS TRAVEL MOBILE LEARNING
BIGDATA INFRASTRUCTURE VERTICAL SAAS GAMING MOBILE COMMUNICATION

TECH FINTECH EDUCATION HEALTHCARE

INTERNET OF THINGS BITCOIN EDUCATION IT LIFE SCIENCES


3D PRINTING PAYMENTS SELF LEARNING DIGITAL HEALTH

4 Big Data Analytics, June 2016


Sector Overview
Scope of report
The report covers companies that provide Big Data Analytics solutions for data discovery, data integration, visualization, data
science and advanced predictive analytics. It excludes services and consultancies that use tech product of other companies for
data analytics.

Report also covers companies that provide Big Data Analytics platform for Heavy Industries, Finance and other verticals.

The funding details provided in the report include venture capital/private equity funding rounds and excludes debt financing and
grants/prize money.

5 Big Data Analytics, June 2016


YoY – Number of companies founded

Alteryx Looker ThoughtSpot BBD Uptake Ndustrial.io

DataSift Platfora Trifacta Dato UNIFi Software Outlier


40 38
37 37

32
27
No. of Companies

24
24

16

10

0
2010 2011 2012 2013 2014 2015

6 Big Data Analytics, June 2016


YoY – Number of Rounds and Total Funding
2000 80
$1.8B

68
73
1600 64

51
Funding Amount (in Millions)

1200 45 48

Number of Rounds
$1B

$847M
800 32
26
25
19
18
14 $399M
400 $346M $353M 16
$227M
$128M
$88M
0 0
2008 2009 2010 2011 2012 2013 2014 2015 2016
Funding Amount Number of Rounds

7 Big Data Analytics, June 2016


YoY – Number of Rounds by Stage
80

64 12
14
Number of Rounds

48

10 8
33 41
32

3 16
22
12
16 12

19
16 14 11
10 11

0 1
2011 2012 2013 2014 2015 2016
Seed Early Late

8 Big Data Analytics, June 2016


YoY – Funding by Stage
1000

800
Total Funding Amount (in Millions)

$413M

600

$389M
400
$125M

$270M
$61M $457M
200 $163M

$231M $267M
$206M
$114M $118M

0 $10M $17M $29M $14M $14M $1M


2011 2012 2013 2014 2015 2016
Seed Early Late

9 Big Data Analytics, June 2016


Average Ticket Size – Early Stage
45.0
$41.4M

36.0
Funding Amount (in Millions)

27.0 $25.4M

$21.2M
$19.3M
18.0 $16.1M
$13.0M

$8.1M $8.5M
9.0
$6.2M $6.3M
$5.5M $5.1M

$2.1M $1.9M $1.6M


$1.2M $1.1M $1.0M
0.0
2011 2012 2013 2014 2015 2016
Seed Series A Series B

10 Big Data Analytics, June 2016


Top investments in last one year
Funding Round
Company Date Investors
Amount Name
Palantir Morgan Stanley, sfsentry.com, In-Q-Tel, Founders Fund, ARTIS
palantir.com
$880M PE 12/01/2015
(Palo Alto, 2004) Ventures

Alteryx
alteryx.com
$85M Series C 10/01/2015 Iconiq Capital, Insight Partners, Meritech
(Irvine, 2010)

ThoughtSpot
thoughtspot.com
$50M Series C 05/01/2016 General Catalyst, Lightspeed Venture Partners, Khosla Ventures
(Palo Alto, 2012)

Looker
looker.com
$48M Series C 01/01/2016 Kleiner Perkins, Redpoint, Meritech, Sapphire Ventures
(Santa Cruz, 2011)

Uptake New Enterprise Associates, Lightbank, General Purpose Vehicles,


uptake.com
$45M Series B 10/01/2015
(Palo Alto, 2014) Caterpillar

Datameer Top Tier Capital Partners, Software AG, Redpoint, Next World Capital,
datameer.com
$40M Series E 08/01/2015
(San Mateo, 2009) Kleiner Perkins, sttelemedia.com

SnapLogic
snaplogic.com
$38M Series E 12/01/2015 Microsoft, Silver Lake
(San Mateo, 2006)

Coveo iqventureadvisors.com, Tandem, BDC, Fonds de solidarit FTQ,


coveo.com
$35M Series D 11/01/2015
(Quebec City, 2005) Telesystem

11 Big Data Analytics, June 2016


Top investments in last one year
Funding Round
Company Date Investors
Amount Name
Trifacta
trifacta.com
$35M Series D 02/01/2016 Accel Partners, Greylock, Ignition Partners, Cathay
(San Francisco, 2012)

DataRobot New Enterprise Associates, Accomplice, Intel Capital, A Ventures,


datarobot.com
$33M Series B 02/01/2016
(Boston, 2012) Recruit Strategic Partners, New York Life Insurance Company

Attivio
attivio.com
$31M Series D 03/01/2016 Oak Investment Partners, tenave.com
(Newton, 2007)

Guavus Investor Growth Capital, QuestM, Artiman, Sofinnova Ventures, Intel


guavus.com
$30M Series F 06/01/2015
(San Mateo, 2006) Capital

Platfora Harmony Partners, Allegis Capital, Andreessen Horowitz, Battery, Citi


platfora.com
$30M Series D 12/01/2015
(San Mateo, 2011) Ventures, Sutter Hill Ventures, Tenaya Capital, Cisco Investments

Qubole
Institutional Venture Partners, Charles River VC, Norwest Venture
qubole.com
(Mountain View, $30M Series C 01/01/2016
Partners, Lightspeed Venture Partners
2011)

Maana Inc Saudi Aramco Energy Ventures, Shell, GE Ventures, Chevron, Intel
maana.io
$26M Series B 05/01/2016
(Palo Alto, 2012) Capital, Frost Data Capital

Tamr SineWave Ventures, New Enterprise Associates, Google Ventures,


tamr.com
$25M Series B 06/01/2015
(Cambridge, 2012) MassMutual Ventures, Thomson Reuters, Hewlett Packard Ventures

12 Big Data Analytics, June 2016


Funnel view of Sector
Percent of
Previous

Founded 288

Funded 169 58.7%

Series A 116 68.6%

Series B 70 60.3%

Late 36

M&A & IPO 41

0 50 100 150 200 250 300 350

No. of Companies

13 Big Data Analytics, June 2016


Most Active Investors in Sector

Pentaho SnapLogic Trifacta Guavus Ayasdi Context Relevant Looker DataSift Ayasdi OSIsoft

DataRobot Platfora Zoomdata Endeca ThoughtSpot Dato Datameer DataRobot Maana Inc Ayasdi
18 17

15
14 14
13
Total Number of Companies

11 10
9 9
8 8 8
7

0
NEA Andreessen Accel Intel Khosla Madrona Redpoint A Ventures GE Ventures Kleiner Perkins
Horowitz

14 Big Data Analytics, June 2016


Where are Top Investors investing
New
Andreessen Accel Intel Khosla Kleiner
Enterprise MADRONA Redpoint A Ventures GE Ventures
Horowitz Partners Capital Ventures Perkins
Associates
Context
Guavus Ayasdi Ayasdi Ayasdi
Contextual Relevant
Data Analytics

Looker DataSift Predixion


Data Analytics DataRobot DataRobot Schemalogic
eBureau DataRobot Software
Looker

Data Pentaho SnapLogic Trifacta


Preparation Tamr Alation, Inc. Paxata

Maana Inc
Uptake Maana Inc
Verticals FusionOps
Weft
Striim
ParStream Seeq Sight Machine Sight Machine OSIsoft
BitStew
Platfora Datameer
Zoomdata ClearStory
Data Discovery Zoomdata ClearStory Data
DataPad
Arcadia Data
Data
Datameer ClearStory
DataPad Data

Streaming Data
StreamBase
Analytics

Data
Tableau CartoDB
Visualization

15 Big Data Analytics, June 2016


Top Investor by Stage of Entry
Seed Series A Series B Later Stage

Techstars 3 Andreessen Horowitz 6 New Enterprise Associates 8 Accel Partners 4


Center for Innovative
2 New Enterprise Associates 5 Accel Partners 5 Goldman Sachs 4
Technology
500 Startups 1 Data Collective 4 Intel Capital 5 Kleiner Perkins 3

A Ventures 1 Lightspeed Venture Partners 4 GE Ventures 4 New Enterprise Associates 3

A-Level Capital 1 A Ventures 3 Khosla Ventures 4 Redpoint 3

Acorn Innovestments 1 Accel Partners 3 A Ventures 3 Sapphire Ventures 3

AlphaLab 1 Google Ventures 3 Andreessen Horowitz 3 Allegis Capital 2

Andreessen Horowitz 1 Intel Capital 3 MADRONA 3 Andreessen Horowitz 2

Baltimore Angels 1 Khosla Ventures 3 AME Cloud Ventures 2 BDC 2

Blume Ventures 1 MADRONA 3 BDC 2 Battery 2

16 Big Data Analytics, June 2016


Major Acquisitions in last one year
Total
Date Company Acquirer Deal size Overview
Funding
Lexmark Enterprise Search
04/20/2016 lexmark.com
apexmic.com 4.0B
(Lexington, 1990)

Sense Data Science Platform as a Service


03/22/2016 sense.io
cloudera.com $1.1M
(California, 2012) Granite Ventures LLC, Illuminate

AimLogic Predictive Analytics for lead scoring


01/19/2016 aimlogic.com
nccdirect.com
(Las Vegas, 2014)

Iris Analytics Real-time decisions for improved fraud and risk control
01/16/2016 iris-
analytics.com ibm.com
(Frankfurt, 2000)

ListenLogic Social Business Intelligence Analytics


01/14/2016 listenlogic.com
marlinequity.com
(San Jose, 2007)

Syncsort Big Data Integration


11/18/2015 syncsort.com
clearlakecapital.com
(Woodcliff Lake, 1968)

Analytics for IoT


ParStream
10/26/2015 parstream.com
cisco.com CrunchFund, Khosla Ventures, Baker Capital, Tola Capital, Data $13.6M
(Cologne, 2008)
Collective

Informatica
08/7/2015 informatica.com
permira.com Data Integration Software
(Redwood City, 1993)

17 Big Data Analytics, June 2016


Big Data Analytics – Insights, Market Trends
• Big Data market was estimated to be $6.8 billion in 2012; and is projected to grow by almost 40% every year. [Source 1]
• The global Business Intelligence and Analytics Software Market is expected to grow from $17.90B in 2014 to $26.78B by
2019, at a Compound Annual Growth Rate (CAGR) of 8.4%. [Source 2]
• Big Data Analytics saw significant amount of funding in 2015/16 with total funding of $2.26B.
• Contextual Data Analytics saw significant increase in funding from 2014 to 2015, receiving ~1B in 2015. Palantir received the
most amount.
• Advanced analytics experienced a rise in demand as analytics companies started deploying predictive models in IoT, Finance,
and other real-world use cases. Adaptive Predictive Technologies got acquired by MasterCard for $600M.
• Data Science Platform is gaining popularity as increasing demand for data models needed for Predictive Analytics.
• Self Service Data Discovery platforms helps business users to get insights from raw data by providing advanced visualization
techniques such as easy querying language, natural language search or visual analytics, thus reducing the dependency on IT
and analysts.
• Self-service Data Discovery platforms using Hadoop and Apache Spark is expected to grow.

-Source 1 -Source 2

18 Big Data Analytics, June 2016


Tracxn BlueBox : Big Data Analytics June 2016
280+ companies in the sector, $5 B invested in last 5 years, $2.26 B invested in 2015/16
Most Active Investors: Accel Partners, Andreessen Horowitz, NEA, Intel Capital
BIG DATA ANALYTICS SUITE $12.5M Verticals $540M
Fusionex (2006, IPO) Microstrategy (1989, IPO) Qliktech (1993, IPO)
OSIsoft
(1980, $140M)
DATA $105M DATA $353M STREAMING $200M SEARCH $173M CONTEXTUAL $2.68B
SCIENCE PLATFORM DISCOVERY DATA ANALYTICS BASED ANALYTICS DATA ANALYTICS

Platfora (2011, $95M) Qubole (2011, $50M) Uptake


Revolution (2014, $45M)
ThoughtSpot
(2007, $32M)
(2012,$91M)
StreamBase
Datameer (2009,$77M)
(2003, Acquired)

FusionOps
Continuum DATA ANALYTICS $860M Palantir (2004, $2B) (2005, $45M)
(2011, $27.7M)
Predictive Technologies
Alteryx (2010, $163M) Looker (2011, $96M)
(1999, Acquired)

DATA PREPARATION $767M Space Time


Dato (2013, $25.25M) (2008, $42M)
Attivio (2007, $102M) Talend (2006, $102M) SnapLogic (2006, $96.3M)

Practice Area – Enterprise Apps | Analyst: @jaiswalmanish30 Cumulative funding in the sector

19 Big Data Analytics, June 2016


Top Business Models by Funding

Palantir Alteryx Attivio OSIsoft Platfora Qubole Birst Inc. ThoughtSpot Revolution Analytics QlikTech

Guavus Predictive Technologies Talend Uptake Datameer StreamBase CartoDB Endeca Continuum Ingensi
3000
$2.7B
Total Funding Amount (in Millions)

2500

2000

1500

1000 $860M
$767M
$540M
500 $354M
$200M $187M $173M $103M
$13M
0
Contextual Data Data Analytics Data Preparation Verticals Data Discovery Streaming Data Data Search Based Data Science Big Data Analytic
Analytics Analytics Visualization Analytics Platform Suites

Avg. Age 2009.8 2008.3 2005.7 2008.4 2009.7 2008.7 2008.8 2006.2 2010.2 2003.6

# Cos. 17 77 35 80 17 22 6 5 16 7

20 Big Data Analytics, June 2016


Contextual Data Analytics – Business Model
Description
Companies which provide Data Analytics Platform for Industry Specific Data Models.

Increasing trend towards Machine Intelligence Platform :


• Companies that provide machine intelligence platform received ~1B funding in 2015.
• Palantir, data analytics platform used by government agencies and law enforcement raised $880M at a $20B valuation in
2015.
• Guavus bolstered its offerings for Communication Service Providers, Cable and media by raising $30M in 2015.
• Companies are moving towards providing machine intelligence platform for IoT.
• Ayasdi raised $55M in Series C to expand the business in Machine intelligence.

21 Big Data Analytics, June 2016


Contextual Data Analytics – Entrepreneur Activity
and Investment Trend
YoY – No. of Companies Founded Distribution of Companies across Countries
3.5 12
Number of Companies Founded

3 3 10
2.8 10

Number of Companies
2
2.1 7

1.4 5 4
1 1 1 1

0.7 2
1 1 1
0
0.0 0
2008 2009 2010 2011 2012 2013 2014 2015 United States India Australia Canada Italy

YoY – Total Funding and No of Rounds


1200 11 12
Funding Amount (in Millions)

$998M
960 10

7 8

No. of Rounds
720 6 $649M 6
6
480 4
3 $303M 4
240 2 $147M
1 $92M 2
$47M $84M
0 $8M 0
2008 2009 2010 2011 2012 2013 2014 2015

22 Big Data Analytics, June 2016


Contextual Data Analytics – Most Funded
Companies
Funding
Company Overview Business Model
Amount
Intelligence products to Augment Human Driven Analysis
Palantir
palantir.com
Ulu Ventures, Founders Fund, Morgan Stanley, Glynn Capital Management, PENSCO, Contextual Data Analytics $2.32B
(2004, Palo Alto)
ARTIS Ventures, YVentures, sfsentry.com, Reed elsevier ventures, In-Q-Tel

Big data analytics for CSPs


Guavus
guavus.com
Artiman, Goldman Sachs, QuestM, Singtel Innov8, Translink Capital, Investor, Intel Contextual Data Analytics $137M
(2006, San Mateo)
Capital, Investor Growth Capital, Sofinnova Ventures, Artiman

Machine Learning Intelligence Plaform for Industries


Ayasdi
ayasdi.com
ventures.citi.com, Institutional Venture Partners, Kleiner Perkins, Khosla Ventures, Contextual Data Analytics $108.13M
(2008, Palo Alto)
Centerview Capital, Draper Nexus, Floodgate, GE Ventures

Provider of Big Data analytics modeling and analytics software.


Context Relevant
contextrelevant.com
Vulcan Capital, work bench, new york life insurance company, Bloomberg Beta, Contextual Data Analytics $44.3M
(2012, Seattle)
Goldman Sachs, Bank of America, Formation, MADRONA, rolling bay ventures

Behavioural Analytics for event Data


Interana
interana.com
Y Combinator, SV Angel, Fuel Capital, Data Collective, Battery, Index Ventures, AME Contextual Data Analytics $28.2M
(2012, Menlo Park)
Cloud Ventures

Infobright Data Analytics Platform For IoT


infobright.com
Contextual Data Analytics $21M
(2005, Toronto) Sun Microsystems, Flybridge, RBC

Spire
spire2grow.com
Recruitment analytics using contextual data Contextual Data Analytics $8M
(2008, Bangalore)

23 Big Data Analytics, June 2016


Data Analytics – Business Model Description
Data models for predictive analytics, forecasting and projections etc.

Advanced Analytics Text Analytics Social Media Data Analytics

Companies that provide predictive Companies that provide content Companies that provide platform for
analytics platform, machine learning, intelligence platform for text analytics and real-time and historical data analysis of
data mining etc. text mining solutions public social conversations such as
Twitter and Facebook.

24 Big Data Analytics, June 2016


Data Analytics – Entrepreneur Activity and
Investment Trend
YoY – No. of Companies Founded Distribution of Companies across Countries
16 45 40
Number of Companies Founded

14
13 36

Number of Companies
11
10
10 27
6
6 5 18 15
4
3
3 9 4 4
1 2 2 1 1
0 0
2008 2009 2010 2011 2012 2013 2014 2015 United India Canada United Israel Switzerland Argentina France
States Kingdom

YoY – Total Funding and No of Rounds BM wise Investment


300 23 25 700 $657M

Funding Amount (in Millions)


$268M
Funding Amount (in Millions)

$246M
240 20 560
16

No. of Rounds
180 15 420
10
120 7 10 280
6
$67M
60 3 5 140
2 $70M
1 $28M $28M $35M
$0M $12M $6M
0 0 0
2008 2009 2010 2011 2012 2013 2014 2015 Advanced Analytics Social Media Data Analytics Text Analytics

25 Big Data Analytics, June 2016


Data Analytics – Most Funded Companies
Funding
Company Overview Business Model
Amount
Alteryx SaaS Analytics Business Suite Data Analytics - Advanced
alteryx.com
$163M
(2010, Irvine) Iconiq Capital, Sapphire Ventures, Meritech, Toba Capital, Insight Partners Analytics

Predictive
Cloud based cause effect analytics software Data Analytics - Advanced
predictivetechnologies.com
Technologies $154M
Accel-KKR, Goldman Sachs Analytics
(1999, Arlington)

Looker BI: Data Exploration and discovery


looker.com
Data Analytics $96M
(2011, Santa Cruz) Kleiner Perkins, First Round, Sapphire Ventures, Redpoint, Meritech, PivotNorth Capital

Blue Yonder Predictive Analytics based on Artificial Intelligence Data Analytics - Advanced
blue-yonder.com
$75M
(2008, Karlsruhe) Warburg Pincus Analytics
Social Data Analytics Platform
DataSift Data Analytics - Social
datasift.com
Northgate, Daher Capital, Scale Venture Partners, Cendana Capital, A Ventures, Insight $64.2M
(2010, Reading) Media Data Analytics
Partners, Upfront
Machine Learning Platform for Predictive Analysis
DataRobot Data Analytics - Advanced
datarobot.com
New Enterprise Associates, Accomplice, Techstars, Recruit Strategic Partners, New York $57.42M
(2012, Boston) Analytics
Life Insurance Company, Atlas Venture, Intel Capital, A Ventures
Predictive analytics on open stack
RapidMiner Data Analytics - Advanced
rapidminer.com
Longworth, Ascent Venture Partners, Earlybird, Open Ocean Capital, Nokia Growth $36M
(2006, Cambridge) Analytics
Partners
Predixion Software Predictive intelligence solutions
Data Analytics - Advanced
predixionsoftware.com
(2009, San Juan Palomar Ventures, Frost Data Capital, EMC, DFJ Frontier, Miramar Venture Partners, $35.8M
Analytics
Capistrano) Accenture, Software AG, GE Ventures

26 Big Data Analytics, June 2016


Data Preparation – Business Model Description
Data integration, transformation, cleaning and loading before visualization process.

Enterprise Search Data Integration Data Cataloging


Companies that provide textual search Companies that provide data Companies that provide data cataloging
based analytics platform for integration platform to integrate data platform to classify structured, semi-
visualization. from multiple sources. structured and unstructured data for
Analytics

27 Big Data Analytics, June 2016


Data Preparation – Entrepreneur Activity and
Investment Trend
YoY – No. of Companies Founded Distribution of Companies across Countries
6 25 23
Number of Companies Founded

5
5 20

Number of Companies
4 3 3 15

2 2 2 10

1 1
1 5
2 2
1 1 1 1 1
0
0 0
2008 2009 2010 2011 2012 2013 2014 2015 United Canada India Austria France Germany Israel Italy
States

YoY – Total Funding and No of Rounds BM wise Investment


200 12 350 $329M

Funding Amount (in Millions)


10 10
Funding Amount (in Millions)

$177M
160 10 280
9
7 8

No. of Rounds
120 6 $111M 210
5 $98M 6 $160M
$89M
80 4 140 $111M
$62M 4
40 $29M $41M 70
2
0
$0M
0 0 0
2008 2009 2010 2011 2012 2013 2014 2015 Data Integration Enterprise Search Data Cataloging

28 Big Data Analytics, June 2016


Data Preparation – Most Funded Companies
Funding
Company Overview Business Model
Amount

Attivio Data discovery, integration and enterprise search Data Preparation - Data
attivio.com
$102M
(2007, Newton) tenave.com, Oak Investment Partners Cataloging

Open source integration software data management tools


Talend Data Preparation - Data
talend.com
Chausson Finance, Iris Capital, Balderton, Galileo Partners, Idinvest Partners, Silver Lake, $102M
(2006, Redwood City) Integration
Bpifrance

iPaaS for enterprise applications


SnapLogic Data Preparation - Data
snaplogic.com
Ignition Partners, H. Barton Asset Management, Microsoft, Andreessen Horowitz, Maples $96.3M
(2006, San Mateo) Integration
Investments, Dhillon Capital, Silver Lake, Triangle Peak, Pharus Capital Management

Trifacta Making data in Hadoop easy for analytics


trifacta.com
Data Preparation $76.3M
(2012, San Francisco) Ignition Partners, Accel Partners, Greylock, Infosys, XSeed Capital, Cathay, Data Collective

Pentaho Data integration company Data Preparation - Data


pentaho.com
$48M
(2004, Orlando) New Enterprise Associates, Benchmark, Index Ventures, dag ventures Integration

Coveo Knowledgebase management through big data analytics and enterprise search Data Preparation -
coveo.com
$69.7M
(2005, Quebec City) Access Capital, Telesystem, Tandem, iqventureadvisors.com, BDC, Fonds de solidarit FTQ Enterprise Search

Lucidworks Search platform based on Apache Lucene and Solr Technologies Data Preparation -
lucidworks.com
$61M
(2007, Redwood City) Granite Ventures LLC, In, Walden International, Shasta Ventures, Allegis Capital Enterprise Search

29 Big Data Analytics, June 2016


Verticals – Business Model Description
Data Analytics Platform for Financial Institutions for Credit Risk Assessment, Trade Analytics etc.

Heavy Industries Logistics and Transportation

Companies that provide platform for Companies that provide analytics


real-time data analytics on machine platform to transport and logistics for
data and IoT data in heavy industries. real-time supply-chain optimization,
logistics operations, freight management
etc.

30 Big Data Analytics, June 2016


Verticals – Entrepreneur Activity and Investment
Trend
YoY – No. of Companies Founded Distribution of Companies across Countries
14 13 50
Number of Companies Founded

12 12 43
11 40

Number of Companies
9
8 30

6 5 20
4
3 9
10 8
3 6
1 3 2 2
0 0
2008 2009 2010 2011 2012 2013 2014 2015 United India United Canada France China Germany
States Kingdom

YoY – Total Funding and No of Rounds BM wise Investment


200 30 450
$184M $395M

Funding Amount (in Millions)


Funding Amount (in Millions)

160 26 25 360
$135M
20

No. of Rounds
120 270
13
15
80 10 $76M 180
8 10
5 $46M
40 3 3 90 $69M
1 5 $45M $31M
$2M $7M $7M $5M
0 0 0
2008 2009 2010 2011 2012 2013 2014 2015 Heavy Industries Logistics and Supply Chain Finance
Transportation

31 Big Data Analytics, June 2016


Verticals – Most Funded Companies
Funding
Company Overview Business Model
Amount
OSIsoft Developer of technology to capture, process, analyze and store sensor data. Verticals - Heavy
osisoft.com
$140M
(1980, San Leandro) Kleiner Perkins, Technology Crossover Ventures Industries - IoT

Uptake Domain specific Data Analytics Verticals - Heavy


uptake.com
$45M
(2014, Palo Alto) New Enterprise Associates, Lightbank, Caterpillar, General Purpose Vehicles Industries

FusionOps Cloud based BI for supply chain Verticals - Supply


fusionops.com
$44.6M
(2005, Sunnyvale) Georgian Partners, New Enterprise Associates, Sierra Ventures Chain

Real time analytics for data from varied sources


Space Time Insight Verticals - Heavy
spacetimeinsight.com
Opus Capital, Start Up Farms International, Zouk Capital, Novus Energy Partners, EnerTech $42M
(2008, San Mateo) Industries - IoT
Capital, NEC Corporation, Informatica, E.on, Opus Capital
Search Engine for Big Data based Applications
Maana Inc Verticals - Heavy
maana.io
Frost Data Capital, Shell, Intel Capital, ConocoPhillips, Chevron, GE Ventures, Saudi Aramco $40.15M
(2012, Palo Alto) Industries
Energy Ventures

Striim Streaming data pipelines and analysis Verticals - Heavy


striim.com
$31M
(2012, Palo Alto) Summit Partners, Intel Capital, Panorama Point Industries

BBD Big Data Solution for the Finance Industry


bbdservice.com
Verticals - Finance $30.65M
(2013, Chengdu) CDH Investments, sanshenggroup.net, Goldman Sachs

Flexibility and insight for lean manufacturing


Sight Machine Verticals - Heavy
sightmachine.com
eLab Ventures, Jump Capital, Mercury Fund, Huron River Ventures, Orfin Ventures, O'Reilly $24.5M
(2012, San Francisco) Industries
AlphaTech Ventures, Pritzker Group, A Ventures, Two Roads Group, FundersClub, GE Ventures

32 Big Data Analytics, June 2016


Data Discovery – Business Model Description
Self service data discovery. Companies that provide data discovery platform to visualize by correlating different data sets from
raw data.

Trends :
• Steady growth in funding since 2011.
• Platfora, built natively on Apache Hadoop and Spark, launched Platfora 5.2 in 2016 that enables “citizen data scientists” to
do self data preparation, visual analysis and behavioral analytics.
• Datameer, built on Hadoop, launched Datameer 6 in 2016 to include Apache Spark for real-time processing in Big Data
Discovery.
• Zoomdata launched Zoomdata 2.2 in 2016, a tool that interacts with Apache Spark using microservices architecture.

33 Big Data Analytics, June 2016


Data Discovery – Entrepreneur Activity and
Investment Trend
YoY – No. of Companies Founded Distribution of Companies across Countries
6 12 11
Number of Companies Founded

5
4.8 10

Number of Companies
3.6 3 7

2.4 2 2 5
3
1 1
1.2 2
1 1 1
0 0
0 0
2008 2009 2010 2011 2012 2013 2014 2015 United States India Japan Netherlands Switzerland

YoY – Total Funding and No of Rounds


90 $86M 8
7
Funding Amount (in Millions)

$76M
72 6
5
$56M

No. of Rounds
54 4 4 4 5
$40M
$36M
36 3
$28M
3
18 1 2
$3M 0
0 $0M 0
2008 2009 2010 2011 2012 2013 2014 2015

34 Big Data Analytics, June 2016


Data Discovery – Most Funded Companies
Funding
Company Overview Business Model
Amount
Self service Big Data Analytics
Platfora
platfora.com
Sutter Hill Ventures, Cisco Investments, Citi Ventures, Andreessen Horowitz, In-Q-Tel, Battery, Data Discovery $95M
(2011, San Mateo)
Tenaya Capital, Allegis Capital, Harmony Partners
Big Data Analytics Platform on Hadoop
Datameer
datameer.com
Kleiner Perkins, Next World Capital, Citi Ventures, Redpoint, Software AG, Workday, Top Tier Data Discovery $76.8M
(2009, San Mateo)
Capital Partners, sttelemedia.com

Logi Analytics BI analytics solutions


logianalytics.com
Data Discovery $48M
(2000, McLean) Grotech, Summit Partners, Updata, LLR Partners

Visual Analytics for big data


Zoomdata
zoomdata.com
Accel Partners, New Enterprise Associates, Comcast Ventures, Columbus Nova Technology Data Discovery $47.2M
(2012, Reston)
Partners, Razor, CIT, 7 Inc, Goldman Sachs

1010data Cloud - Big Data Analytics


1010data.com
Data Discovery $35M
(2000, New York City) Norwest Venture Partners

ClearStory Data Analyze data from internal and external sources


clearstorydata.com
Data Discovery $30M
(2011, Palo Alto) Google Ventures, Kleiner Perkins, Khosla Ventures, Andreessen Horowitz, dag ventures

Arcadia Data BigData analytics


arcadiadata.com
Data Discovery $12.68M
(2012, San Mateo) Mayfield, Blumberg Capital, Intel Capital

Smart Insight Data Discovery graph search


smartinsight.io
Data Discovery $4M
(2013, Tokyo) INCJ

35 Big Data Analytics, June 2016


Big Data Analytics – Company List

DETAILS CLOSE TO 288 COMPANIES


(169 FUNDED AND 119 UNFUNDED COMPANIES)

COVERS THE FOLLOWING SECTORS


Contextual Data Search Streaming Big Data
Data Data Data Data
Data Science Based Data Verticals Services Analytics
Analytics discovery Preparation Visualization
Analytics Platform Analytics Analytics Suite

36 Big Data Analytics, June 2016


Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Contextual Data Analytics (1/5)


Company Details Funding Investors
Palantir [Palo Alto, 2004]: Palantir's platforms provide data analysis capabilities to enterprises through its products
Gotham and Metropolis. Gotham integrates structured and unstructured data and enables discovery, analysis, and
Ulu Ventures, Founders Fund, Morgan
knowledge management. Metropolis conducts quantitative data analysis, data integration, and information
Stanley, Glynn Capital Management,
management. Palantir's products are extensively used by federal agencies of the U.S. intelligence, state governments,
$2.32B PENSCO, ARTIS Ventures, YVentures,
local governments, and financial institutions around the world. Some of the solutions include case management, anti-
sfsentry.com, Reed elsevier ventures,
fraud, disaster preparedness, law enforcement, defense, and insurance analytics. Acquisitions: Kimono labs in
In-Q-Tel
February 2016; FT Technologies in February 2015; Propeller in July 2014; Poptip in July 2014; Voicegem in February
2013
Guavus [San Mateo, 2006]: Guavus provides Big Data analytics solutions, which enable intelligent decision-making for
network operations, marketing, customer care and monetization. The company’s data analysis offering is specifically
Artiman, Goldman Sachs, QuestM,
used by telecommunications providers to give these companies insights on network performance, device and
Singtel Innov8, Translink Capital,
application usage, content and subscriber behavior to optimize network capacity and increase revenues. The company
$137M Investor, Intel Capital, Investor
counts 4 of the top 5 mobile network operators, 3 of the top 5 Internet Backbone providers, as well as 80% of cable
Growth Capital, Sofinnova Ventures,
MSOs in North America as customers. It currently Claims to analyze more than 50% of all US mobile data traffic and
Artiman
processes more than 2.5 petabytes of data per day. Deployments span retail, enterprise and wholesale channels
across data, wireless, wire-line and multimedia product offerings.

Ayasdi [Palo Alto, 2008]: Ayasdi automates and accelerates insight discovery. The company’s Machine Intelligence
ventures.citi.com, Institutional
software employs Topological Data Analysis (TDA), to simplify the extraction of knowledge from the most complex
Venture Partners, Kleiner Perkins,
data sets confronting organizations today. Developed by Stanford computational mathematicians, Ayasdi’s approach $108.13M
Khosla Ventures, Centerview Capital,
combines machine-learning algorithms, abundant compute power and topological summaries to revolutionize the
Draper Nexus, Floodgate, GE Ventures
process for converting data into business impact.

Vulcan Capital, work bench, new york


Context Relevant [Seattle, 2012]: Context Relevant is a leader in Big Data analytics. Context Relevant has developed
life insurance company, Bloomberg
an analytics system that ingests data, determines what’s important, and creates a problem-specific solution that can
$44.3M Beta, Goldman Sachs, Bank of
be deployed while continually learning.The proprietary high-performance machine learning technology accelerates
America, Formation, MADRONA,
analytics and actionable insight. It is backed by some of the major companies in the Financial Services Domain.
rolling bay ventures

37 Big Data Analytics, June 2016


Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Contextual Data Analytics (2/5)


Company Details Funding Investors
Interana [Menlo Park, 2012]: Interana is a fast and scalable event-based analytics solution that answers critical
business questions on how customers behave and products are used. With Interana, key business metrics that matter
most in a data-driven world - such as growth, retention, conversion and engagement. Interana allows customers to Y Combinator, SV Angel, Fuel Capital,
discover and investigate these key insights easily through its visual and interactive interface. Streaming data and data $28.2M Data Collective, Battery, Index
at rest are analyzed together, enabling companies to see how real-time snapshots fit into historical trends. It analyze Ventures, AME Cloud Ventures
massive volumes of call detail records for understanding how users behave and interact with products and services.
Sony, Tinder, Jive, Bloomboard and Orange are among its customers.

Infobright [Toronto, 2005]: Infobright develops analytic database designed for applications and data marts that
analyze large volumes of machine generated data such as web data, network logs, telecom records, stock tick data
and sensor data. Solution is capable of storing and analyzing sensor data at IoT scale. Common use cases are for $21M Sun Microsystems, Flybridge, RBC
enterprises, SaaS and software companies in online businesses, telecommunications, financial services and other
industries to provide rapid analysis of critical business data.

Spire [Bangalore, 2008]: Spire offers big data based enterprise technology that offers various solutions for supply
chain management, customer relationship management, fraud intelligence, talent growth management and predictive
talent intelligence. Its technology solutions enable enterprises to make informed decisions with accurate demand-
$8M
supply mapping from any combination of structured and unstructured data. Also provides fraud detection solutions by
pulling in data from unstructured data such as documents, emails, SMS and reviews. The company has previously
raised $1 million in a seed round from several angel investors.

Market6 [Ohio City, 2006]: Market6 is a big data analytics company that leverages retailers operational data to
improve overall business performance and enable better collaboration with suppliers. Our flagship is DemandView®,
a suite of tools, reporting, and predictive analytics provided via Information Services or SaaS solutions that provides a $5.5M Sevin Rosen Funds
real-time, forward-looking view of sales, promotion and distribution performance. DemandView is used by the largest
supermarket chain in the US, and is accessed by over 400 supplier partners

38 Big Data Analytics, June 2016


Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Contextual Data Analytics (3/5)


Company Details Funding Investors
dMetrics [Brooklyn, 2009]: dMetrics is building the definitive map of consumer decision-making. It operated on a
NLP based engine and has the team of MIT PhDs which have built a platform to analyze thousands of social networks,
blogs, and forums, identifying and connecting people, their decisions about products, and rationale for these
decisions. The result is a dynamic interest graph, helping companies and individuals make better decisions about
products to use, campaigns to run, and companies to invest in. Its clients (from Fortune 500 to non-profits) use
dMetrics to get the most accurate, unbiased, and transparent view of people’s actions, tracing decision making all the
way from high-level market insights to high-value discussions. Understanding what people do - and why - is the next $2.3M National Science Fdn
frontier of text analytics. Clicks, Likes and Retweets do not representatively convey the voice of the consumer,
comprising a tiny portion of the information contributed by Internet users. Most of the Internet (other than photos of
kittens) is conversational: blog posts, forum discussions, Twitter dialogues. It claims to have a platform that extracts
consumer decisions from billions of online conversations. So far, its platform has served clients in healthcare which
has resulted in the database of healthcare-centric insights about 14,000+ products. Similar repositories of consumer
insights for CPG, telecom, and banking industries are underway.

deepsense.io [Menlo Park, 2014]: DeepSense product SeaHorse enables users to build Spark applications using an
intuitive visual environment without the need to write any code. By dragging and dropping blocks from the palette of
available operations users are able to design any data workflow that contains such steps as: ETL, data manipulation, $2M CodiLime
clean-up and reports, even up to advanced predictive modeling. It has a open-source community edition while the
enterprise edition offers some more features.

Aureus Analytics [Mumbai, 2013]: Provider of predictive analytics and big data ready platforms for insurance
companies and banks. Their platform, ASAP, allows users to create, publish and execute analytical models. Focused on
Europe &; the South East Asian Market. Customers include Bharti AXA Life Insurance, Aegon Religare Life Insurance $850k
and General Insurance Council of India among others. Headquartered in Singapore with R&;D in Mumbai. Secured
angel funding of $850k from a group of individual investors through LetsVenture.

39 Big Data Analytics, June 2016


Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Contextual Data Analytics (4/5)


Company Details Funding Investors
Innovaccer [Noida, 2012]: InnovAccer offers tools for researchers to collect, structure, and connect data. Collects
data through its partnership with organizations, licensing private data sets and scouring the web. Its product
managers and data scientists then validate the new information, upload it into its platform, and then connect this data
to new datasets. Provides Big Data Analytics, Data Mining, Data Standardization and Visualization, Predictive Analytics,
Operations Research, Statistical Analysis solutions. Also offers Datashop, a cloud based bank of customizable research
500 Startups
data and research infrastructure. Collaborated with researchers from 160+ Universities including Harvard, Wharton,
Stanford, MIT, INSEAD, NYU, UC Berkeley, etc. Raised seed funding in May 2015 in a round led by Rajan Anandan,
Google's vice president for South East Asia with other investors being 500Startups; redBus founder Phanindra Sama;
Teru Sato, chief executive of Beenos Group; Aneesh Reddy, founder and CEO of Capillary Technologies; and
Venkatesh Valluri, chairman of Ingersoll Rand India.

DtoK Lab S.r.l. - Scalable Data Analytics [Rende, 2014]: DtoK Lab develops SaaS system, which analyzes large
amounts of data from its platform DtoK Lab. It uses a scalable algorithm for parallelizing data analysis applications
modeled as complex workflows. The algorithm efficiently exploits the vast storage and computing potentialities of
Cloud systems. The system can support ad hoc data analysis solutions on both public and private Clouds.

Entrigna [Hoffman Estates, 2011]: RTES provides a real-time decision making platform that features complex event
processing, machine learning, optimization and rules engine along with Big Data and Internet of things data
processing. It has developed over 20 statistical algorithmic techniques for 5 core capabilities - prediction of future
events - business rule engine - optimization - classification/clustering - recommendation engine. It has
following business applications- Marketing offer optimization, customer churn prediction, inventory / yield
management, omni-channel delivery, customer service recovery etc.

Symberra [Canberra, 2015]: Symberra is a cloud-based data analytic platform that specialises in large-scale data
modelling and simulations. Optimised data storage and simulation algorithms enable the domain experts to gain in-
depth insights from big data in a fraction of the time required today.

40 Big Data Analytics, June 2016


Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Contextual Data Analytics (5/5)


Company Details Funding Investors

Penser Analytics [Bangalore, 2014]: Penser Analytics offers realtime contextual management information system.
Their real time analytics suite provides understanding of the business through contextual reports, enabling the clients
to explore various aspect of the business in real time. It claims to be integrate with the client's CRM solution to act on
various observations. It offers solution for industries such as sales, amrketing , healthcare, HR , eCommerce etc

Sciera [Atlanta, 2004]: Sciera offers big data solutions which allows enterprises to make digestible and usable for
micro-targeting and competitive differentiation. Offers three products namely Real Watch, Social Watch and
Competitive Intelligence. Real Watch allows companies leverage home status to correctly position their products to
increase sales. Social Watch is a text analytics engine which allows companies to track social media, gain insights and
manage their engagement with customers. Also offers strategy consulting and analytics research services.

41 Big Data Analytics, June 2016


Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Data Analytics (1/19)


Company Details Funding Investors
Looker [Santa Cruz, 2011]: Looker provides BI analysis and visualization solutions for enterprise.It is a web-based
business intelligence platform that has developed a new data description language called LookML. With LookML,
analysts can create and curate custom data experiences so any employee can explore and utilize the data of relevance Kleiner Perkins, First Round, Sapphire
to them. Looker integrates with a number of the more popular modern databases – including Amazon Redshift, $96M Ventures, Redpoint, Meritech,
Teradata Aster, HP Vertica, Greenplum, Impala, Bigquery and Spark. Looker is being used by customers such as Yahoo PivotNorth Capital
!, Warby Parker, Asana,Instacart, Docker, Venmo, Upworthy and Gilt. It claims 400% annual growth and a customer
list of 250 individual organizations.

PoweredAnalytics [Pittsburg, 2012]: PoweredAnalytics is a cloud-based predictive analytics as a service platform


delivers actionable data-driven insight that creates customer segmentatioin . It was supnnout from Pitsburgh startup AlphaLab
accelerator Alphalab.

Koverse [Seattle, 2012]: Koverse provides a demand-driven platform for big data that enables users to run advanced
analytics against various data source and develop result driven applications. Its demand-driven platform makes
existing data perform on-demand, and consolidates data silos and provides built-in analytical infrastructure with the
intelligence to handle changing requirements. The company’s platform offers solutions for various on demand needs,
Credit Suisse
including threat intelligence, data sets integration, real-time situational awareness, real-time and historical data
combination, customer insight, stovepipes elimination, acquiring data in existing state, analytical applications
development rationing elimination. In February 2016, the company has got strategic investment from Credit Suisse
Asset Management.

Return Logic [Carlisle, 2014]: Return Logic provides a cloud-based return management platform that enables retailers
to manage and optimize their product returns strategy. The software provides return analytics through two-step
process: a physical returns process in which a percentage of returned product is intercepted and inspected by trained
technician, and analysis of returns data to rapidly detect trends and identify specific issues with individual SKUs.

42 Big Data Analytics, June 2016


Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Data Analytics (2/19)


Company Details Funding Investors

Cloud Theta [Noida, 2013]: Cloud theta is building a cloud based platform which provides analytic-as-a-service to the
customers in real time. Aims to provide data driven decision making insights to social, ecommerce, CRM and digital
media companies. Yet to launch as of May 2016.

Zerrabyte [Bangalore, 2014]: Zerrabyte is a stealth mode VC funded startup with the team distributed across US and
India. The company is working on a big data analytics platform for enterprises and various other open source projects.

Alteryx [Irvine, 2010]: Analytics Business Suite for Finance, Operations, Sales, Communications, Retail and Marketing
used in Restaurants, Real Estate, Healthcare, Financial Services and Hospitality. Partners- Tableau, HortonWorks, Qlik, Iconiq Capital, Sapphire Ventures,
Revolution Analytics, Teradata and Cloudera. Key Customers- Experian Marketing Services, Ford, McDonald’s, Sprint $163M Meritech, Toba Capital, Insight
and Wal-mart. Funded by SAP Ventures, Thomson-Reuters, and Toba Capital, Alteryx serves 500+ customers and Partners
200,000+ users worldwide.

Predictive Technologies [Arlington, 1999]: APT is a cloud-based analytics software company that enables
organisations to measure cause-and-effect relationships between business initiatives and outcomes to generate
economic value. APT’s Test &; Learn for Sites, Test &; Learn for Customers, Test &; Learn for Ads, and other similar
products employ patented algorithms and workflow to design and interpret business experiments that evaluate,
$154M Accel-KKR, Goldman Sachs
target, and refine proposed business programs. Also offers products that support decision-making for specific
business needs including transaction analysis, space planning, promotion design, category management and location
selection. Walmart, Starbucks, Coca-Cola, Victoria’s Secret, American Family, Hilton Hotels, SUBWAY, TD Bank, T-
Mobile, and others are among its customers.

43 Big Data Analytics, June 2016


Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Data Analytics (3/19)


Company Details Funding Investors
Blue Yonder [Karlsruhe, 2008]: Blue Yonder provides predictive analytics using Big Data analytics. They leaverage
machine learning techniques to generate insights by integrating data sources and running analytics. The company
provides it clients with a SaaS based platform which can be used in demand planning , routing optimization, $75M Warburg Pincus
scheduling , customer analysis and dynamic pricing. They were a Gartner Cool Vendor in 2015. Some of their
customers include Coca Cola, Bosch, OTTO, Deutsche Bahn AG among others.

DataRobot [Boston, 2012]: DataRobot Predictive is a machine learning platform and leverages the cloud to generate
predictive models (which have long played a key role in various industries like health care, sales and marketing, and New Enterprise Associates,
finance), is still in beta. Analytics narrows down the search universe based on the characteristics of the training data Accomplice, Techstars, Recruit
set and prediction target. It executes only the most relevant end-to-end procedures for fitting a model (called $57.42M Strategic Partners, New York Life
Modeling Blueprints), to deliver the best predictive model in the fastest time possible. DataRobot uses cloud Insurance Company, Atlas Venture,
computing to cost-effectively evaluate thousands of Modeling Blueprints in parallel. It then systematically applies a Intel Capital, A Ventures
cross-validation framework to accurately compare the performance of even the most diverse modeling techniques.
RapidMiner [Cambridge, 2006]: RapidMiner is an integrated environment for machine learning, data mining, text
mining, predictive analytics and business analytics. Used for business, industrial education, rapid prototyping,
Longworth, Ascent Venture Partners,
application development and supports all steps of the data mining process including results visualization, validation
$36M Earlybird, Open Ocean Capital, Nokia
and optimization. RapidMiner enables customers to create a data-analytics workflow and grab data from a variety of
Growth Partners
sources without scripting using built-in connectors, then process that data to see patterns that might require action.
Acquired Radoop a big data analytics company to extend predictive analytics to Hadoop.
Predixion Software [San Juan Capistrano, 2009]: Predixion Software develops and markets predictive analytics
solutions fully integrated with Microsoft’s BI platform. Predixion enables self-service predictive analytics allowing Palomar Ventures, Frost Data Capital,
customers to use and analyze large amounts of data to make actionable decisions, within the familiar environment of EMC, DFJ Frontier, Miramar Venture
$35.8M
Excel and PowerPivot. It launched its cloud-based predictive analytics platform in 2010 for enabling real-time, Partners, Accenture, Software AG, GE
predictive analytics from the Internet of Things (IoT). Revenue growth of over 800%, on average, for 2011 and 2012 as Ventures
well as closing Q1 2013 revenue. Predixion was selected as a finalist for Red Herring's Top 100 North America award.

44 Big Data Analytics, June 2016


Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Data Analytics (4/19)


Company Details Funding Investors
eBureau [Cloud, 2004]: eBureau helps consumer-facing businesses find their next customer more cost effectively
through predictive analytics. With its next-generation predictive analytics platform, eBureau delivers insights to
online marketers, financial services companies and agencies. The company’s suite of lead quality scoring, audience
Horizon Technology Finance
targeting, contact center optimization, and risk management solutions provide outstanding results for clients seeking
$30M Management LLC, Redpoint, Split
to optimize performance throughout their customer life cycle. Its patented system architecture combines proprietary
Rock, Tenaya Capital
database management software and vast amounts of predictive content in order to seamlessly integrate hundreds of
billions of records across thousands of databases, covering nearly all U.S. adults and households. Its embedded
statistical modeling technology is used to develop eScores for its clients .
Signals [Netanya, 2009]: Signals provides decision analytics based on open source intelligence for better, faster
decision-making. The unique solution is determined by the best of man and machine: superior automated technology
for targeted big data collection and analysis, and teams of intelligence analysts and subject matter experts for insight
delivery and verification. Insights are delivered via a dynamic cloud based intelligence dashboard, monitoring and Qumra Capital, Tpycapital, Sequoia
$25M
alerting services, and customized intelligence reports. Signals has supported C-suite decision-makers in world-leading Capital
companies in varied industries (life sciences, consumer product goods, ICT, financial services, energy, agriculture, and
others) to make countless critical decisions related to new product development, competitive intelligence, market
intelligence, regulation and exploration of new spaces of growth. Signals is formerly known as d&;a Visual Insights.
Alpine Data Labs [San Francisco, 2011]: Alpine Data Labs is a developer of the advanced analytics platform chorus,
working with Apache Hadoop. Its flagship product Alpine Chorus is a collaborative, code-free solution for Advanced Robert Bosch Venture Capital, UMC
Analytics on Big Data and Hadoop. Chorus provides a platform for data scientists, IT and business management to Capital, Mission Ventures, Stanford
$23.5M
share and collaborate on analytics project. BOSCH, BlackBerry, EMC2, Sony and GE Capital are some of its customers. University, Sierra Ventures, Sumitomo
In February 2014, Alpine Data Labs was added to the Gartner Magic Quadrant as a "Niche Player". In march 2014, Corporation Equity Asia
Alpine Data Labs was certified by Databricks on Apache Spark.

Skytree [San Jose, 2012]: Skytree® – The Machine Learning Company® provides an advanced enterprise-grade
Javelin Venture Partners, United
Machine Learning platform that gives organizations the power to discover deep analytic insights, predict future
Parcel Service, U.S.Venture Partners,
trends, make recommendations and reveal untapped markets and customers. Skytree’s flagship product – Skytree $20.5M
Plug & Play Ventures, Osage
Server – is a general purpose scalable Machine Learning system on the market, built for accuracy at unprecedented
University Partners
speed and scale.

45 Big Data Analytics, June 2016


Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Data Analytics (5/19)


Company Details Funding Investors
AtScale [San Mateo, 2013]: Atscale helps create Dynamic Cubes to reduce the complexity of traditional approaches
by eliminating the need for data movement or tool-specific data structures. These Cubes or data models allows Comcast Ventures, Ameventures,
Business analysts to be more effective. Existing BI tools can be used to connect to Hadoop using AtScales's virtual cube $20M XSeed Capital, UMC Capital, AME
with JDBC, ODBC etc to extract the required information. It has a hybrid query service and an adaptive cache along Cloud Ventures, Storm Ventures
with a design dashboard to create the required data models.

Prevedere Inc [Columbus, 2012]: Prevedere provides Big Data predictive analytics to large enterprise customers,
private equity, and investment firms. Analyst and Data Scientist across finance, procurement, sales and marketing can
use the software to forecast sales and raw material price for enterprise customers. Investment firms and hedge funds
$7.9M Rev1 Ventures, PointGuard Ventures
use its software to perform industry and portfolio holding analysis to gain higher returns on investment. Customers
includes Racetrak, Hamilton, Aecom, Momentive etc targeting software, networking and hardware in several
industries and specialties including mobile commerce, cloud computing, health care, education and Big Data.
Crayon Data [Chennai, 2012]: Crayon Data provides a SaaS Big Data Analytics platform with focus on hospitality,
finance, retail, technology verticals. Offers two products— Simpler Choices and One Drop Analytics. Simpler Choices is
a choice engine that helps consumers and businesses make better-informed and smarter decisions, while One Drop
Analytics boosts B2B sales and marketing intelligence and demand generation by providing insights on companies and
markets. Has 10 customers including a mid-sized hotel chain in the UK, an MNC bank in Singapore and a lifestyle &;
$7.34M Jungle Ventures
fashion retailer in India. CEO claims a run rate of $500,000 and hopes to close 2015 at $6M in revenue. Backed by
Jungle Ventures, Spring Seed Capital along with some individual investors. HQ in Singapore with dev center in
Chennai, India. Received an undisclosed funding from Ratan Tata in Nov '15. Raised an undisclosed investment from
conglomerate Mitsui, through which Mitsui will support the launch of Crayon’s products in Japan, and the expansion
of its customer assets.

OThot [Pittsburgh, 2014]: Othot is a SaaS-based predictive analytics platform and intelligence-driven solutions for
easy integration of predictive analytics into business processes. OThot captures, processes and analyzes large data
$2.4M Opus Global
sets, providing its customers with understandable results. Its first product The Calisto Decision Predictor answers the
questions specific an industry by combining the consumer profiles + Calisto predictive scoring engines.

46 Big Data Analytics, June 2016


Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Data Analytics (6/19)


Company Details Funding Investors

DecisionNext [San Francisco, 2013]: DecisionNext delivers big data predictive analytics software in the cloud. Large
companies use DecisionNext to support high-value decisions ranging all the way from purchasing to sales. It support
$2M
specific functions including commodity price forecasting, purchasing optimization, product mix optimization, capacity
optimization, and pricing and promotion optimization, connecting these optimizations into a single system .

BigML [Corvallis, 2011]: BigML offers scalable, cloud based machine learning service. Through a simple interface users
can quickly analyze their data and build predictive models without any prior expertise. The user can explore these
models for new insights and use them to make predictions. It can be used to - analyze and predict customer behavior, $1.4M
increase customer loyalty, increase site conversion, diagnostics to support in healthcare, and risk profiles among other
applications..

DMWay [Tel Aviv, 2013]: DMWay provides predictive analytics uses past data, current data, demographics
information and other relevant data, to predict future events. Prediction results are often given in probabilistic terms,
for example the probability that a customer will respond to a new product offering, the expected donation amount to
a charity, and so on.Predictive Analytics encompasses a variety of techniques from statistics, machine learning, AI, $1M Jerusalem Venture Partners
data mining, optimization and others to analyze the relevant data in order to predict the future events. Business
applications of predictive analytics are:Targeting churn, customer retention, fraud detection, risk analysis and
customer lifetime value.
Turing Data, LLC [ , 2011]: Turing Data provides human behavioral big data predictive analytics. Turing Data also
develops its own software, varying from mobile apps to an Saas platform for market research polling, banking and
insurance processing. For the Saas platform, data is either provided directly by Turing Data’s suite of applications or
$1M undisclosed
can be imported in a number of formats. It is a multi-national entity, whose corporate governance and global function
executives are based in Israel, algorithm and software delevopment in Greece and the United States, with sales and
marketing planned in the USA and Europe.

Angoss [Toronto, 1984]: Angoss delivers predictive analytics and data mining solutions for growing revenue,
increasing sales productivity and improve marketing effectiveness while reducing risk and cost. It provides
$616k undisclosed
comprehensive modeling, patented Decision Trees, strategy design to analysts with flexibility to code in the language
of SAS, SQL and R or simply utilizing pre-built functional nodes for automated code generation.

47 Big Data Analytics, June 2016


Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Data Analytics (7/19)


Company Details Funding Investors

Predictify [Raleigh, 2014]: Predictifyme provides predictive analytics solutions by combining private data and
publically available data such as economic surveys , expenditure surveys, county statistics, and other personal,
$305.26k undisclosed
economic, social, political and futuristic indicators. By connecting all data sets, it form a larger picture - an exhaustive
list of data points for a particular person, product, market, or industry

Zero Locus [Milwaukee, 2013]: Zero Locus creates predictive analytics software. Zero Locus Inc. develops software
systems to perform advanced analytics for various data sets. It serves customers in the areas of insurance, health, $20k Gener8tor
retail, and finance industries

PredictedNow [Coventry, 2014]: PredictedNow is developing methods for continuous, automated analysis of online
data to measure and predict collective human behavior. Currently in stealth mode.

Zementis [San Diego, 2004]: Zementis, Inc. provides software solutions for predictive analytics. Core solutions include
ADAPA, a decision engine for predictive analytics and UPPI, plugin utility for analytics and data warehouse platforms.
Solutions can be deployed on-premise and in the cloud. Customers include financial institutions, marketing and
advertising agencies, consumer and enterprise technology service providers, telecom and government agencies.
ADAPA helps data science team and IT department to collaborate on development and deployment of predictive
models. Partnered with Ngdata to provide real-time predictive analytics. UPPI allows integrating predictive analytics
into other analytic workflows. UPPI can be integrated into Hadoop ecosystem and also to other databases. Have
offices in San Diego, San Francisco, and Hong Kong.

48 Big Data Analytics, June 2016


Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Data Analytics (8/19)


Company Details Funding Investors
FICO [Minneapolis, 1956]: Fico provides predictive analytics and market segmentation tools to support business
strategies that will help decision makers to predict and chose from a set of options that would drive business
performance. Provides FICO Analytic Cloud wherein businesses can access advanced analytic tools in the cloud to
build, customize, configure and deploy solutions to improve their business decisions. Provides Fico Score which is a
standard measure of consumer credit risk in the United States, helps people manage their personal credit health. Also
provides various offering across marketing and customer engagement, fraud and security, bankcard scoring solution
etc. Fico also offers decision management suite, decision modelling, bigdata analytics etc and is an end to end
business strategy solution. Clients include many of the top fortune 500 companies including BMW, Dell, Walmart, GM,
and Chase.
ISIS Solution [ , 2004]: ISIS Solutions provides the ISIS Predictive Forecasting and ISIS EnterpriseZ Discovery &;
Predictive Analytics software that enables business users to harness the power of predictive analytics through a
simple English language interface, without programming, spreadsheets or IT support. Contains a broad range of
predictive analytics which covers variance analysis, statistical best practices, leading indicators and correlations.
Predictive forecasting provides visualization through analysis, trending, forecasting and prediction. EnterpriseZ
Predictive analytics provides user interface to business users which they can use to interact with applications
deployed on Enterprisez server. Predictions, report types, charts and statistical analysis are included in EnterpriseZ.
Provides EnterpriseZ applications and EnterpriseZ market applications in different markets such as healthcare,finance,
transportation and manufacturing. Provides cloud and on-premise offering.

Autonomy [Chicago, 1996]: Autonomy uses advanced analytics pattern-matching technology for analytsing
documents, pictures, emails, videos etc to extract information from call detail records, gene sequencing, sensors,
algorithmic trading, click streams, and other sources. Autonomy software helps businesses and organizations with
meaning-based solutions that understand the full spectrum of enterprise information as well as the relationships that
exist within it.

AimLogic [Las Vegas, 2014]: Aimlogic provides predictive lead scoring through its proprietary bigdata technology. It
helps in microtargeting the leads falling within a particular geaographical region and helps in improving lead turnover
as compared to organic scoring. Its proprietary algorithm creates predictive models of leads with the highest
propensity to convert in the targeted area. By using this technology, users can gain insights into customers' financial
profiles and retail behavior combined with thousands of other essential consumer data points and 30 years of public
and private historical market activity.

49 Big Data Analytics, June 2016


Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Data Analytics (9/19)


Company Details Funding Investors
SPSS South Asia Pvt. Ltd. [New York City, 1968]: IBM SPSS Statistics is a statistical software used to solve business
and research problems by means of ad-hoc analysis, hypothesis testing, and predictive analytics. Organizations use
IBM SPSS Statistics to understand data, analyze trends, forecast and plan to validate assumptions and drive accurate
conclusions.Statistics included in the base software:Descriptive statistics: Cross tabulation, Frequencies, Descriptives,
Explore, Descriptive Ratio StatisticsBivariate statistics: Means, t-test, ANOVA, Correlation (bivariate, partial, distances),
Nonparametric testsPrediction for numerical outcomes: Linear regressionPrediction for identifying groups: Factor
analysis, cluster analysis (two-step, K-means, hierarchical), Discriminant

Predictvia [Caracas, 2014]: Predictiva's first product, Seenatra does automated, Machine Learning-based, modeling
of the customers of a product that allows clients to predict both current customers and potential customers behavior.
With this Customer Model, Seenatra can then find potential customers for the clients among Social Network users.
Using this clients decrease the cost of customer acquisition and increase the quality of customer lifetime value.
Through adaptive learning methods, this efficiency increases over time.

Follow @OT_Analytics [San Mateo, 1993]: Actuate founded and sponsors BIRT, an open source reporting software
with a user-friendly Integrated Development Environment (IDE) for developers.It has developed a product range
around BIRT for BI solutions. They are BIRT Analytics, BIRT iHUB &; Customer Communication Management. It has the
capabilities of providing big data analyics. Currrently it provides software to more than three million BIRT developers
and OEMs. Some customers include Bankdata, Blue Cross Broadridge Financial Solutions, Inc., CA, Inc., CGI Group Inc.,
Elcom International, Inc., ING Life Insurance and Annuity Company, Lloyds Banking Group, MetLife Inc., MSCI Inc and
more

50 Big Data Analytics, June 2016


Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Data Analytics (10/19)


Company Details Funding Investors
7Puentes [Banfield, 2007]: 7Puentes builds BigData applications and products exploiting big data sources arising
social media, mobile devices and web customer intelligence, using new big-data platforms based on the ‘map-reduce’
parallel programming paradigm. Menta is their main big data product, a full equiped personalization systems
including: personal recommendations, most similar items, cart abandoned and churn strategies, up sells, metrics
dashboard. Ventura is their Social media intelligence product, which gathers information from social media sites,
using both intrusive or non-intrusive means, from open and closed social networks. Gives organizations deeper and
richer insights into business patterns and trends, helping drive operational efficiencies and competitive advantage.
7Puentes has the ability to solve business problems and provide new business opportunities. This simple shift can
transform the perspective, changing big data from a technological problem to a business solution. 7Puentes partners
with Cloudera, the largest ecosystem in the Hadoop market, to deliver an affordable, scalable and fully supported big
data infrastructure without the risks of a custom built solution.

Insight Jedi [Bangalore, 2014]: Insight Jedi automates the workflow of making a decision with data using their
statistical products which is oriented around mathematical techniques and proprietary algorithms. It claims that the
platform automatically tests hundreds of business hypotheses and recommends actions to achieve goals specified by
the clients.

Rapid Insight Inc. [ , 2002]: Rapid Insight is Enterprise predictive analytics &; data intelligence software for data
science, higher ed, fundraising and healthcare. Users can easily build predictive models and integrate, aggregate,
cleanse and transform data into decisions with no programming or SQL skills required.

KNIME [Zurich, 2006]: Knime is an open analytics platform that is a statistical modelling tool used for predictive
forcasting of data. The modular data exploration platform, initially developed at the University of Konstanz, Germany,
enables the user to visually create data flows, execute selected analysis steps, and later investigate the results
through interactive views on data and models.

51 Big Data Analytics, June 2016


Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Data Analytics (11/19)


Company Details Funding Investors

Abivin [Hanoi, 2015]: Based in Vietnam, ABIVIN is a big data analytics company currently focusing on its 3 products in
the form of APIs; VDocs- to analyse and report document form, VRoute- machine intelligent method to solve order
routing and VCore- its predictive analytics API for banking, ecommerce.

Formcept [Bangalore, 2011]: Formcept is a Big Data Analysis platform that can provide batch processing, interactive
analysis and stream processing capabilities to the enterprises. Fromcept empowers the existing data analysts and data
scientists of an organization to extract insights out of data faster, thereby significantly reducing the time taken to
convert data into decisions. It also offers offers competitive analysis of companies in a particular domain. Its products
are available both as Software as a Service (SaaS) and as installable products. Enterprises can also write customized
applications (called “Intents”) on top of the platform according to their business needs or targeting their end
customers.

kyvosinsights [Los Gatos, 2000]: Kyvos Insights enables Big Data analytics with “OLAP on Hadoop” technology. it
allows to build complex workflows with simple drag and drop without coding for data in Hadoop. The cubes created
for analysis of multi-dimensional data can be connected to visulaization to create interactive dasboards with charts,
graphs etc.

Precog [Boulder, 2010]: Precog is a data analysis platform that helps companies to develop advanced analytics
applications on unstructured data. The platform accepts data from multiple sources such as Hadoop, databases, APIs
RTP Ventures, Resonant Venture
and can query data created in multiple formats:JSON, logfile, XML, etc. It lets users enrich the data and analyze it
$2.77M Partners, Techstars, LaunchCapital,
using either a REST API or Precog's development environment: Labcoat. Labcoat is a visual query builder based on an
VegasTechFund
open source programming language called Quirrel and supports functions such as sentiment analysis, predictive
modeling and machine learning. Precog was acquired by RichRelevance in August 2013

52 Big Data Analytics, June 2016


Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Data Analytics (12/19)


Company Details Funding Investors
DataSift [Reading, 2010]: DataSift enables companies to aggregate, filter and extract insights from the billions of
public social conversations on Twitter, leading social networks and millions of other sources. DataSift provides access
Northgate, Daher Capital, Scale
to both real-time and historical social data to uncover insights and trends that relate to brands, businesses, financial
$64.2M Venture Partners, Cendana Capital, A
markets, news and public opinion. Delivered as a cloud platform, DataSift clients include companies with
Ventures, Insight Partners, Upfront
requirements for social media monitoring, social CRM, business intelligence, financial trading and news monitoring
applications. DataSift is a certified Twitter data reseller partner.
Germin8 [Mumbai, 2007]: Germin8 offers a cloud-based platform for enterprises which is focused on building and
providing stakeholder analytics tools and services that help its customers make better decisions. Its flagship product,
Explic8 is used by companies to understand what their stakeholders are saying about them, their products and
$5.4M InnoVen Capital, Kalaari
services and their competitors in public (social media, news sites) and in private (emails, surveys, chats, calls). Brands
and agencies use Explic8 for brand monitoring, lead generation, online reputation management, influencer
management.

PropheSee [Delhi, 2014]: Provider of cloud based solution for brands to discover and analyze the data about their
digital presence and develop actionable insights to optimize performance. Provides data tools to track brand's &;
Indian Angel Network, Stanford
competitors' image across social media, industry specific channels and integration with Google analytics. Provides a $500k
Angels & Entrepreneurs
dashboard to track all the relevant channels and offers automated or custom reporting. Raised $500k in angel funding
from Indian Angel Network, Stanford Angels and Entrepreneurs India in Dec 2015.

ListenLogic [San Jose, 2007]: ListenLogic is a provider of advanced social business intelligence and social threat
detection to the leading brands across the media, entertainment, food and beverage, consumer packaged goods,
retail, pharmaceutical and technology sectors. ListenLogic’s advanced social intelligence solutions use “big data”
processing at 1+ billion streaming classification operations per second (SCOPS) to deliver corporations understanding
of their markets, consumers and competitors. The result is precise, real-time, actionable insight from the open social
media universe to set strategy, guide decision-making, drive innovation and protect the business.

Schemalogic [Kirkland, 2001]: Smartlogic Semaphore is a content intelligence platform that unifies unstructured with
Goldman Sachs, The Phoenix Partners,
structured data under a consistent set of semantics, allowing the organization to discover new insights, develop
$17.4M undisclosed, Phoenix Partners Group,
competitive strategies. The product is available as both onpremise ans saas offering and can analyse text, mails,
MADRONA
conversations etc of both internal and external stakeholders.

53 Big Data Analytics, June 2016


Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Data Analytics (13/19)


Company Details Funding Investors
Motista [San Francisco, 2007]: Motista delivers predictive intelligence on consumer emotion. Motista quantifies the
emotions most motivating consumers to buy, recommend and pay more for specific brands. Motista measures
emotion with unprecedented accuracy, ties emotion to business outcomes, and delivers actionable intelligence into $10.3M El Dorado Ventures
“why” consumers buy. It’s clients - Fortune 1000 market leaders, in the financial services, CPG, retail, healthcare,
hospitality, Internet and B2B markets.

ClearForest [Boston, 1998]: ClearForest provides SaaS platform for text analytics and text mining solutions.It offers
several hosted solutions, including: OpenCalais, a free web service and open API (for commercial and non-commercial
use) that enables automatic metadata generation using the company's financial module. Semantic Web Services
(SWS) , an on-demand service that makes its natural language processing tools available as a standard web service. In
order to allow the development community to explore the value of building innovative applications and services that $10M Greylock, Walden Venture Capital
leverage semantic processing. Gnosis is its free Firefox extension that uses SWS to analyze the content of a web page.
With a single click, Gnosis identifies the people, companies, organizations, geographies and products on the page
users are viewing. It also automatically processes pages from Wikipedia itself - providing additional links for people,
geographies and other entities which were not explicitly linked within the subject article.
Lymbix [New Brunswick, 2009]: Lymbix is a text analytics solution that measures the tone and emotional impact of
words in everyday written language. The Lymbix technology delivers highly precise sentiment analysis and
determines how words and phrases expressed in blogs, posts, email, or other social media make people feel (e.g.
$1.25M GrowthWorks
Angry). Developer and partners in the Social CRM, Customer Support, Text Communications, and eDiscovery fields
(among others) can add value to their products and services by leveraging the Lymbix API for a deeper understanding
of the sentiment in text.
ScanandTarget.com [Paris, 2007]: Scan &; Target provides a SaaS solution for the real-time analysis of text user
generated content (UGC). The solution is aimed at publishers of community and social networks, moderation service
providers, providers of technical solutions for web (forums, blogs, chat, social buying, etc.) and professional services
i.e. CRM and contact centers. It also provides digital conversations analysis for government agencies, service Scientipole Capital, Crédit Agricole
$1.1M
publishers, marketing agencies, e-commerce sites, and media. The company’s text meaning technology understands Private Equity
the meaning behind text, offering customers the relevant information they need to moderate, monetize, or gather
intelligence from digital communications, including SMS, e-mails, forums, chat, blogs, tweets, and Facebook
comments. It offers moderation, social CRM, audience segmentation, and sentiment analysis solutions.

54 Big Data Analytics, June 2016


Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Data Analytics (14/19)


Company Details Funding Investors
ThriveMetrics [New York City, 2011]: ThriveMetrics is an enterprise software for mining corporate electronic
communications for topics, volume trends and sentiment allowing companies to better monitor and understand
engagement with their marketplace and within their employee base. Its enterprise data analytics technology uses
proprietary algorithms on real-time digital communications such as email, IM, enterprise social, etc. to monitor how
companies engage with their customers and how management and employees engage with each other. Mining $600k undisclosed
topics, trends, sentiment, frequency and volume from those real communications, The company claims to have
Fortune 100 Financial Services and Insurance firms as clients. It includes customizable dashboard reporting for Sales
Enablement, Business Process Re-engineering, Management of Initiatives (such as those related to Diversity &;
Inclusion) and Compliance.

Textual Analytics Solutions [Bangalore, 2004]: Textual analytics is a SAAS firm that provides infrastructure for
information processing and analysis. They offer two different products - document to data stream conversion
(conversion of unstructured documents to structured data streams) and Integration of Disparate Data-streams ( $250k Mumbai Angels
integration and modeling of unstructured from data-streams into an unified grid output which is then modeled with a
consistent data structure). It helps to bring different informations into a single pane.

SetuServ [Chicago, 2012]: SetuServ's mission is to help clients derive actionable intelligence from their raw text data
by using their technology solutions that combines experts, crowdsourcing and machine learning algorithms. They
claim that the enterprises today have access to various sources of unstructured data such as social media data, user
feedback etc. that contain rich insights, but machines can only analyze it partially as the data is not structured. They $160 k
have developed a proprietary model called Skierarchy that allows them to handle complex tasks while maintaining
high quality and privacy of the clients data by using machine learning techniques to filter for the information that
could be of interest and to assist the crowd with productivity and quality.

OdinText [New York City, 2015]: It does both structured and unstructured data(text) analytics solutions. It analyses
various sources of data like social media monitoring(product reviews, twitter, facebook etc.), customer experience
management(call logs, emails, satisfaction data), market research. Customers include the likes of Shell, Coca Cola,
NBC.

55 Big Data Analytics, June 2016


Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Data Analytics (15/19)


Company Details Funding Investors
Redlink GmbH [Rome, 2013]: RedLink provides text analytics solutions for enterprises, including an highly
customizable cloud-based multi-lingual content enrichment and semantic search PaaS along with out-of-the-box
plugins for existing content management systems. The solution can be used by Content Managers and Developers for
semantic enrichment and search. The platform is built on opensource technologies such as Apache Stanbol, Apache
Solr and Apache Marmotta.

Inspica [Bangkok, 2012]: Inspica provides natural language processing software for text analysis, text mining and
enterprise search. It provides Thai Text Analytic Software which is a program to solve NLP problems such as Thai Word
Tokenization, Thai Romanization, Sentiment Analysis, Topic Categorization. Inspica is a search Plugin for Apache Solr
which is a plugin to make Apache Solr understand Thai.

EvoApp [Raleigh, 2009]: EvoApp analyses the patterns in relevant conversations that impact business decisions. Their
real-time data mining and analysis platform gives big data meaning by correlating with metrics that drive businesses.
0
Use cases are lead generation, improve customer service, target audience and accelerate product launches. The
company is dead pooled now.

Retechnica [London, 2012]: Retechnica provides two products Ingenia API and Compass Insight. Ingenia API is the
text analytics engine: it enables users to make sense of the content, define how to categorise the content. It comes
with an advanced and flexible recommendation engine, and with a summarisation engine. Clients use it to introduce
smart navigation, personalisation or analytics features in their products. Compass Insight is the information
aggregator for enterprise: it's for teams that need to make sense of large amounts of information to inform business
decisions. It enables users to monitor competition, products and topics + identify trends, inspire innovation, generate
actionable insight. It aggregates information in one intuitive interface. It has a text analytics API + web app that tags
users content automatically. It's unique in that it uses tags bespoke to content, keeps content organised. It can be
used as a collaboration tool to organise content, for e-commerce company,it can extract actionable insight from
reviews of their products, for news app it can summarise the key points of an article SEO etc.

56 Big Data Analytics, June 2016


Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Data Analytics (16/19)


Company Details Funding Investors
Kaypok [Markham, 2012]: Keypok provides a content analytics tools to uncover insights in unstructured text. Users
discover what people are saying, can explore their data and get actionable insights. Kaypok’s high performance
algorithms automatically process noisy, unstructured information and extract usable knowledge and insights about
what people are saying, sentiments and the root information elements which drive analytics. Unlike other
technologies, the algorithms need no training or dictionaries, making them scalable and applicable to many different
application areas. The technology analyzes data regardless of source and includes public social media, enterprise
textual data, surveys, emails and blogs.
Trending [San Francisco, 2013]: Trending provides text analytics solution to discover, track, and forecast better
investment and business decisions. It uses data mining and machine learning platform that captures real-time
information from the unstructured text and data of technology news, commentary, and social media. Statistical
natural language processing lets users recognize, validate, and disambiguate the most important topics to deliver
specific and actionable signals to the customers. With interactive visualizations, charts, relative sentiment, direct links
to news, and clustering, users such as investors, company managers, and finance professionals can leverage
informational advantage.
Quiverity [San Francisco, 2013]: Quiverity develops technology that identifies and ranks the most relevant qualitative
information across the web, curated specifically for finance professionals. Quiverity enables clients to interpret
massive amounts of information to support internal research and validate their investment thesis. Sourcing content
from online media, blogs, social networks, video, transcripts, press releases, and filings, aggregates and analyzes
information on companies, products, technologies, people, places, and other related topics. It provides data and
research through its web and mobile apps, as well as via historical time series datafeeds for more rigorous analytics.

MeshLabs [Bangalore, 2009]: MeshLabs is a provider of advanced text analytics and social engagement solutions to
solve information management, customer experience, BI, and regulatory compliance problems of businesses.
MeshLabs's text analytics engine is a hybrid mix of linguistic, statistic, and semantic approaches. MeshLabs provides its
services in social listening, text analytics, and natural language processing. MeshLabs has been acquired by
Pegasystems in May, 2014 for an undisclosed amount.

57 Big Data Analytics, June 2016


Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Data Analytics (17/19)


Company Details Funding Investors

Zyboorg [Kochi, 2014]: Provides tech based AI solutions for enterprises. Also offers Text Analysis and Natural
Language Processing tasks including sentiment analysis, Q&;A and summarization. Offers applications and services in
various cloud based platforms. Its product PreLaunch validates mobile apps by providing predictive analytics regarding
the ratings, downloads etc and help developers and startups. Incubated at Startup Village.

Surukam Analytics [Chennai, 2014]: Surukam is a technology-driven startup for text analytics and natural language
processing for legal, finance, education, publishing and HR domains. They help in automation of decision-making
processes and workflows. Surukam uses next generation Artificial Intelligence solutions.

Stride [Bangalore, 2013]: Offers a platform for enterprises to generate insights from unstructured data from various
sources. Modules include sentiment analysis, subjective sentiment analysis, topic modelling and spell correction. Is
offered both as an API and a SaaS based offering with a dashboard visualization enabling companies make sense of
text.

Zyphion [Bangalore, 2014]: Developing a cloud based platform for businesses to transform social noise into valuable
insights. Also developing NLP based analytics solutions and call-to-action engagement features. Enables data
collection from online and offline transactions, mobile devices, audio and video or any other type of sensor and
available third party data along with building analytical models to help understand data. Product of Enmerchant
Business Solutions Lab and is still under development.

TCorpus [Bangalore, 2013]: Developed 2 products - FirmCONNECT and EmailCONNECT - which allow users to extract
data from various sources, integrate and present alongside business function categories. FirmCONNECT is a product
for text analytics in Capital Markets for analysts and investors. EmailConnect is a cross-industry text analytics product
for E-mails with facilities for integrating big data sources and analytics. Provides solutions to customer-specific
problems with unstructured &; Big Data using components of Natural Language Processing, Machine Learning &; Big
Data platform. Currently caters to the Financial services industry. Operates in Bangalore and incubated at VIT - TBI.

58 Big Data Analytics, June 2016


Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Data Analytics (18/19)


Company Details Funding Investors

Open Data [Lahore, 2015]: Open Data Systems is an open data research and BigData management company. Claims
to develop hapoop based statistical data and predictive analytics platform. Provides both IaaS and SaaS interfaces.

Treparel [Utrecht, 2007]: Treparel is a provider of text analytics and visualization technology. Text analytics can be
used by corp-orates and government sector for analyzing enterprise content, websites, email, patents or research
literature across industries like Patent Services, Fraud Detection, eDiscovery and Forensics. It provides them a
modular clustering, machine learning based classification and visualization technology; partners can benefit from
years of development, research and client experiences.

Coginov [Brossard, 2002]: Coginov provides Natural Language Processing (NLP) or semantic analysis technology
makes it possible to automatically understands content, structure knowledge and outline accurate and valuable data.
Coginov has developed the CoginovAPI, a developer friendly and licensable semantic analysis technology that can be
integrated into any solution to facilitate implementation and on-going document information processing. The
CoginovAPI can assist with the mass extraction, classification, summarization, and provide sentiment analysis over any
content processed by its solution. The company’s uniqueness comes from the advanced language rule based
algorithms . Other product portfolio is the CoginovAP, a semantic analysis engine that can be licensed by software
vendors and large organizations that need advanced text analysis for email management, enterprise content
management, and social media monitoring. Coginov customer base is comprised of organizations in the government,
public services and the private sector including: BNP Paribas, Cirque du Soleil, Gildan, Hôpital Charles LeMoyne,
Hydro-Québec, L’Oreal, Ottawa Center for Research and Innovation, SAAQ etc

Receptiviti [Toronto, 2014]: Receptiviti’s core IP, the Linguistic Inquiry and Word Count, is a language-psychology
based text analysis. It is licensed and used by organizations like the NSF, NIH, US Army and some of the world’s
technology companies. The pricing starts from 2,000 user (for startups) priced at $250 and 40,000 user (for business)
at $2,500.

59 Big Data Analytics, June 2016


Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Data Analytics (19/19)


Company Details Funding Investors
Datoin [Bangalore, 2014]: Datoin uses Bigdata text analytics from CEM(Customer Experience management) which
generates lots of text data, customer interaction through e-mails, chat, transcribed telecalls, reviews, social media
feedbacks. Analyze aforementioned data and improve customers satisfaction.Developer of web software to aid
businesses build applications using off the shelf components by assembling them as pipeline of components. The
applications can be run as API. Currently developed an application for crawling &; extraction to extract structured
data from crawled web pages. In the process of developing Machine Learning APIs and other applications.
MeaningCloud [New York City, 2015]: MeaningCloud offers a SaaS based semantic API. Its application-specific web
services are optimized for various industries and scenarios (e.g. social media analysis, semantic publishing).
MeaningCloud consists of a set of web services that extract elements of meaning (topics, facts, opinions,
relationships) from all kinds of unstructured multimedia content. Second, these capabilities are packaged, published
and provided in such a way as to be meaningful to specific businesses and application scenarios. Pricing starts at
$99/month and goes to $999/month based on the number of features.

Squirro [Zurich, 2012]: Squirro brings internal and external customer data together delivering 360° near real-time
picture of client engagement and market trends. It collects and analyses data across multiple sources such as Service
$1.5M FormulaVC
Desks, Configuration Management Databases, Internal Chat and external feeds to give a real-time picture of
customers.

60 Big Data Analytics, June 2016


Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Data Discovery (1/4)


Company Details Funding Investors
Platfora [San Mateo, 2011]: Platfora is an end-to-end software platform to run natively on Hadoop and spark, to
Sutter Hill Ventures, Cisco
provide raw data preparation, in-memory acceleration, and rich visualizations to better share insights. An interactive
Investments, Citi Ventures,
and visual full-stack platform delivered as subscription software in the cloud or on-premises, Platfora Big Data
$95M Andreessen Horowitz, In-Q-Tel,
Analytics is creating data-driven competitive advantages in the areas of security, marketing, finance, operations and
Battery, Tenaya Capital, Allegis
the Internet of Things. Leading organizations such as Citi, Comcast, DirecTV, Disney, Edmunds.com, Opower, Riot
Capital, Harmony Partners
Games, Vivint and The Washington Post use Platfora.

Datameer [San Mateo, 2009]: Datameer provides Big data Analytics solutions to discover insights from data via
Kleiner Perkins, Next World Capital,
wizard-based data integration, iterative analytics, and visualizations. Datameer uses Hadoop for both storage and
Citi Ventures, Redpoint, Software AG,
compute, and can integrate with existing data warehouse or business intelligence solutions. Its patent pending $76.8M
Workday, Top Tier Capital Partners,
technology, Smart Execution, uses in-memory technology to optimize the analytics pipelines at runtime based on
sttelemedia.com
data, resources and cost. Some of its customers include Visa, Citi, Bank of America and more.

Logi Analytics [McLean, 2000]: Logi Analytics (formerly LogiXML) enables organizations to create web-based BI and
analytic applications that can be integrated directly within the applications, systems, and processes that support their
Grotech, Summit Partners, Updata,
business. Its technology allows organizations to rapidly develop, deploy, and adapt applications to serve business $48M
LLR Partners
users without extensive development or professional services. It is headquartered in McLean, Virginia, with sales and
support offices in the UK serving Europe. It currently serves 1600 customers worldwide.

Zoomdata [Reston, 2012]: Zoomdata provides data analysis and visualization solutions. It has an array of data
connectors capable for sourcing data from different nodes including big data sources like Impala, HDFS, MangoDB also
Accel Partners, New Enterprise
Social Media and Proprietary Databases. It offers a set of pre-built visualizations like scatter plot and offers custom
Associates, Comcast Ventures,
dashboard creation feature. Designed to support Big Data, Zoomdata's Stream Processing technology delivers real $47.2M
Columbus Nova Technology Partners,
time data feeds to tablet and browser based devices. Through the use of touch screen devices, users are able to
Razor, CIT, 7 Inc, Goldman Sachs
interact with data in real time, rewind the data, compare the data and share views with their colleagues. As of 2014 it
had 20 paying customers.

1010data [New York City, 2000]: 1010data provides a cloud-based platform for big data discovery and data sharing
.The platform offers advanced analytics such as statistical modelling based on data models such as regression analysis $35M Norwest Venture Partners
etc. Other features includes data integration and visualisation .

61 Big Data Analytics, June 2016


Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Data Discovery (2/4)


Company Details Funding Investors
ClearStory Data [Palo Alto, 2011]: Clearstory offers a business intelligenence platform with rich visualization and
sharing capabilities and recommendation engine to correlate different datasets. It has a platform for integrating a Google Ventures, Kleiner Perkins,
company’s internal and external data using an in-memory database leveraging Apache Spark, an open-source $30M Khosla Ventures, Andreessen
clustering system for in-memory technology. Companies like Hersheys, Coca Cola, Colagate, Merck and Amazon web Horowitz, dag ventures
services are some of its customers.

Arcadia Data [San Mateo, 2012]: Arcadia Data builds unified visual analytics and BI Platform for big data. . The
Arcadia Converged Analytics Platform unifies visual exploration and back-end data analytics in one integrated
enterprise platform that runs natively on Hadoop cluster. It converge the visual, analytics and data layers to provide Mayfield, Blumberg Capital, Intel
$12.68M
accelerated access to all of the data stored within Hadoop, and support net-new analytics on granular datasets.It Capital
claims to have customers within the Fortune 200, with two customers using its platform to analyze more than 100B
rows stored in Hadoop.
Smart Insight [Tokyo, 2013]: Mugen from Smart Insight is a big data discovery and analytics platform that help
autonomous relationship discovery across enterprise systems for data model preparation. InSight has developed a
software platform that enables enterprises to analyze both structured and unstructured data, visualize correlations,
and gain business values and insights. Business has complex enterprise systems that hold important information about $4M INCJ
key business entities like customer, products, and locations. Mugen connects to various enterprise systems to
autonomously develop an Enterprise Data Graph giving you insights into the field entities that related across all your
systems.connects to a wide variety of both on-premise &; on-cloud systems including HDFS, RDBMS, NoSQL, Solr
SynerScope [Eindhoven, 2011]: Synerscope software provides an interactive visual analytics suite that allows business
users to explore and analyze data in an intuitive way. Relationships between enormous numbers of data entities
permits quick visual identification of anomolous patterns to detect and analyze deviant process in white collar and IT
$3.3M
work, as well as cybersecurity and fraud. All the inventory of data is also shown visually to and can be processed by a
fully automated integrated work flow engine. Customers need modest training and therefore Total Cost of Ownership
(TCO) is low.

62 Big Data Analytics, June 2016


Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Data Discovery (3/4)


Company Details Funding Investors

DataPad [San Francisco, 2013]: Datapad provides data preparation and visual analytics solutions. They offer visual Accel Partners, Google Ventures, SV
analysis tools to SMEs with features like custom dashboards and collaboration. Companies like Datahero and Chart.io $1.7M Angel, Andreessen Horowitz, Ludlow
are its competitors. It got acquired by Cloudera.com in 2014. Ventures

SIFT Business Intelligence [Portland, 2015]: SIFT is a cloud-based data discovery BI platform that fills the gap between
dashboards and enterprise-level reporting. It employs Smart Templates that enable users to ask any question of the
data, drill down into details of interest, save favorite views, share insights with others, and generate scheduled
reports that are delivered via e-mail. Customers includes Conocophilips, NCI, Suncore, Braun etc

InetSoft Marketing [Piscataway, 1996]: Inetsoft provides java and web-based business intelligence software for
reporting, analytics, dashboards, and visualization, combining disparate data sources in real time. Its patent pending
Data Block™ technology enables reuse of queries and a capability for end-user defined data mashup. It supports big
data sources like Hadoop. InetSoft solutions have been deployed at over 3,000 organizations worldwide, including
Fortune 500 companies, spanning all types of industries.

iccube [Ecublens, 2009]: icCube's Business Intelligence suite is a real-time analytical and visualization engine. High
scalability OLAP Server and Flexible BI Web Reporting's are offered in the Free Community Edition, while additional
features for dashboard and visualization tools are provided in the Corporate Edition for complex deployments.

Ideata Analytics [Indore, 2013]: Ideata analytics is a big data intelligence platform that provides an end-to-end,
analytical application for business users. The platform helps users perform information discovery on their big data
including interactive reporting, dashboarding and analytics along with working with data sources of their choice. Along
with the platform, offers several pre-packaged analytical apps and tools to build custom apps required for business.

63 Big Data Analytics, June 2016


Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Data Discovery (4/4)


Company Details Funding Investors

Scalend [Bangalore, 2014]: Developing solutions to provide big data insights for businesses. Currently in private beta.
Part of Startup Next Bangalore accelerator in Fall 2014.

Tuplejump [Hyderabad, 2013]: Tuplejump provides a platform that offers the infrastructure components, tools and
blueprints to build big data powered applications. The infrastructural service enable to collect, store, analyze and
visualize data. The platform finds its application in organizations working in the fields of Internet of Things and digital
advertising to gain insights from vast amount of data that is being collected.

Bizalyticks [Fullerton, 2013]: Bizalytiks offers a analytics platform called Customer Vector Analytics platform which
allows users to access, manage and analyze operational data for business intelligence. The platform automates
analytics platform, automatically creates analytical models that makes bespoke predictions and recommendations.
Bizalyticks OneCloud solution can connect to many different Hadoop clusters: Cloudera Distribution including Apache
Hadoop (CDH), the Hortonworks Data Platform (HDP), Apache Hadoop with Hive, Amazon Elastic MapReduce, and
MapR Hadoop allowing analytics on Big Data

64 Big Data Analytics, June 2016


Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Data Preparation (1/9)


Company Details Funding Investors
Trifacta [San Francisco, 2012]: Trifacta provides platform for discovery, cleaning, shaping of Big Data. Trifacta's
software automates the process of transforming data from database sources like Hadoop into something that can be
Ignition Partners, Accel Partners,
easily digested by software visualization and business intelligence tools. The technology is based on research at
$76.3M Greylock, Infosys, XSeed Capital,
Berkeley and Stanford that focuses on interactions between human and machines to transform data into something
Cathay, Data Collective
useful and meaningful. Pricing is based on the volume of data a company processes using the Trifacta software, and
typically starts at roughly $100000 to $150000. LinkedIn, Autodesk and EMC Corporation are some of its customers.

Tamr [Cambridge, 2012]: Tamr makes it easy to curate data from large number of sources using machine learning
New Enterprise Associates, SineWave
techniques reducing man-hours spent to achieve similar goals. It provides an automated workflow to connect and
Ventures, MassMutual Ventures,
create models of data source semantics while engaging data experts to re-tune them if necessary. It works with a $41.2M
Hewlett Packard Ventures, Thomson
wide number of data source repositories HDFS, Google cloud storage, amazon redshift, mongoDB, cassandra etc.
Reuters, Google Ventures
Tamr can be deployed as both on-premise software or as a Cloud-based solution.

Paxata [Redwood City, 2012]: Paxata provides a Adaptive Data Preparation platform that enables business analysts to
turn all raw data into ready data for analytics. It automates data integration eliminating the need to write code,
Accel Partners, EDB Investments,
exposes semantic data quality through a visually interactive graphical interface, enables contextual enrichment, adhoc $26.02M
Walden International, Toba Capital
collaboration and transparent governance. Paxata’s connection with BI tools like Tableau, QlikView and Excel gives
business people total flexibility to use the visualization and discovery solutions they prefer to use.

Waterline Data [Mountain View, 2013]: Waterlinedata is an automated data discovery platform. It first catalogs all
the data in hadoop automatically. It then enables users to find, understand, and provision the data for use in data Infosys, Menlo Ventures, Partech
$23M
prep or analytics tools. It can be used when data iin hadoop is in millions of columns and manually tagging and Ventures, Jackson Square Ventures
prepping the data is tedious. It also helps in maintaining data governance.

Datactics [Belfast, 1999]: Datactics provides data consolidation and re-engineering software, including edit distance
(fuzzy) matching, equipping the business user to access information held in multiple formats and languages, through
Thule Investments, Clarendon Fund
massively parallel processing (MPP) and in-memory technology. Datactics software populates missing fields and $1.28M
Managers
realigns the data by parsing unstructured data and using adaptive master reference files, capturing the knowledge
held by the data experts in your organization.

65 Big Data Analytics, June 2016


Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Data Preparation (2/9)


Company Details Funding Investors
Nube Technologies [Noida, 2010]: Nube Technologies helps businesses with Identity Resolution through
deduplication of repetitive data. Nube's Reifier combines machine learning with big data technologies to sift through
complex data, identify and remove duplicate data points. Reifier lets companies leverage existing internal data to
increase sales and operational efficiency by keeping the data clean and free from repetition. It also develops Hadoop
Data Integration software ,HiHo, to integrate with various databases, ftp servers, salesforce to do Incremental update,
dedup, append, merge data on Hadoop. It also develops Crux, a reporting application for HBase.

Attivio [Newton, 2007]: Attivio crawls and classifies all the data information -- structured, semi-structured, and
unstructured – to speed time-to-value for analytics. Attivio’s Active Intelligence Engine(AIE) connects all information
and delivers it in a single view, revealing the hidden relationships, insights, and intelligence that allows user to take $102M tenave.com, Oak Investment Partners
business decisions. It also provides Enterprise search solutions through a targeted search using the AIE to provide
more relevant search results. The company has offices in Massachusetts, Israel, Germany, and the UK.

Alation, Inc. [California, 2012]: Alation simplifies enterprise unstructured data by centralizing knowledge into a single
place using machine learning and human analysts. The search can be made using a keyword in English. Searchable,
Comprehensive Data Documentation. Alation automatically builds a Catalog of useful data documentation, covering
Costanoa, Andreessen Horowitz, Data
all of the data sources. Access relevant information (including experts, lineage, keys and indexes, relevant queries) $9M
Collective, General Catalyst
and documentation on all tables, across your organization’s data sources, in seconds.Alation helps users understand
which tables and queries to start your learning with and see which tables are most often used using automatically
calculated popularity ratings.
Talend [Redwood City, 2006]: Talend is an open source software vendor that provides data integration, data
management, enterprise application integration and big data software and services. Talend Master Data Management
(MDM) unifies data from customers to products to suppliers and beyond into a single, actionable version of the truth. Chausson Finance, Iris Capital,
Talend MDM combines real-time data, applications, and process integration with embedded data quality and $102M Balderton, Galileo Partners, Idinvest
stewardship to share across on-premises, cloud and mobile applications. It has more than 4000 paying customers. Partners, Silver Lake, Bpifrance
Customers include eBay, Virgin Mobile, Sony Online Entertainment, Deutsche Post and Allianz. It has 350+ employees
in 14 offices in 7 countries. Has a 108% annual revenue growth rate.

66 Big Data Analytics, June 2016


Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Data Preparation (3/9)


Company Details Funding Investors
SnapLogic [San Mateo, 2006]: SnapLogic is an integration platform that allows companies to connect applications
Ignition Partners, H. Barton Asset
with each other both in the Cloud and on premise. It is offering solutions for data integration with its integration suite,
Management, Microsoft, Andreessen
which includes SnapLogic Server, SnapLogic Designer and the SnapStore. The SnapLogic integration platform enables
$96.3M Horowitz, Maples Investments,
the communication of legacy and cloud applications. It is built on a RESTful architecture and connects applications
Dhillon Capital, Silver Lake, Triangle
through a standardized SnapLogic engine. Projects can be built with full interface support from the native SnapLogic
Peak, Pharus Capital Management
library and SnapStore components.
Pentaho [Orlando, 2004]: Pentaho is a company that offers Pentaho Business Analytics, a suite of open source
Business Intelligence (BI) products which provide data integration, reporting, dash boarding and data mining. A
combination of deep native connections and an adaptive big data data layer ensures accelerated access to the leading New Enterprise Associates,
Hadoop distributions, NoSQL databases, and other big data stores for blending structured and unstructured data $48M Benchmark, Index Ventures, dag
coming from disparate sources. Allows to profile data and ensure data quality with comprehensive capabilities for ventures
data managers. Won InfoWorld Bossie Award 2008, 2009, 2010, 2011, 2012. Over 1,000 Enterprise customers such as
ideeli, Kiva, Marketo, SpecSavers, and Swissport.

Treasure Data [Mountain View, 2011]: Treasure Data offers cloud based big-data platform. Treasure Data's cloud
service and technology are specifically designed to provide an easier way to manage and analyze high-volume, high-
Scale Venture Partners, Sierra
velocity, semi-structured data. Treasure Data's cloud service includes capabilities for data collection, storage and SQL $28.75M
Ventures, AME Cloud Ventures
analysis. Treasure Data's corporate customers include MobFox, Getjar, GREE, Yahoo Japan and several Global Fortune
500 companies. Monthly subscription based monetization model.

Alooma [Tel Aviv, 2013]: Alooma’s cloud-based service allows their clients to organize their data sources better,
pulling in information from multiple origins. Their service relies heavily on Amazon Redshift, using it as the core of Lightspeed Venture Partners, Sequoia
$15M
their operations where users can upload all of their data into. Alooma lets users draw from an expansive set of 24 Capital
data sources including iOS, Azure, Google Analytics, MySQL, Salesforce, and others.

UNIFi Software [San Mateo, 2014]: UNIFi Software is a data integration platform that runs natively on Hadoop.
Designed for business people, UNIFi delivers self-service data integration to the business users who analyze the data
Canaan, Omaha Capital, Pelion
for insights. With direct access to more data, business users can pursue “what if” scenarios without relying on IT— $14.45M
Venture Partners
dramatically increasing time to insight while freeing up IT resources. UNIFi runs on existing infrastructure, making
processing fast at a lower cost.
67 Big Data Analytics, June 2016
Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Data Preparation (4/9)


Company Details Funding Investors
Capsenta [Austin, 2012]: Capsenta, incubated out of the University of Texas at Austin, provides Enterprise Data
Platform, Ultrawrap which uses advanced graph representation and semantic technologies to virtually integrate
diverse data sources regardless of database platform to be virtualized without the need for data centralization.
Ultrawrap allows integration of data sources by creating a virtual analytics layer. Gartner recognizes capsenta as a
"cool vendor" for Data Ingtegration and Data Quality in 2016. Raised about $7.5M in seed stage financing.

Serendio [Santa Clara, 2010]: Serendio is a Big Data science platform that prepares the dat through collective stages
of the Ingest, Process, Secure, Persist, Analyze. The company calls it data ops as a service. Along with SaaS platform
also provides managed services around harnessing the data. Platform is designed to address a broad range of analytic
workloads –both streaming (Storm) and batch (Kassandra, MongoDB, HDFS), structured and unstructured (such as
enterprise, social and sensors data), all through reusable design patterns, templates, and domain specific workflows.
Popular platform configurations include Data Lake, Data Mart, and Data Stream. It has customers across industries
like Healthcare, Retail, Insurance among many others.
Xurmo Technologies [Bangalore, 2014]: Xurmo is an analytics company that has built a platform for unified analytics
on big data to host self-learning, predictive applications. Xurmo's flagship platform, TURF Ai, is an advanced platform
for self-service of big data analytics. Xurmo enables self-learning, predictive applications to be built by business
analysts and software developers with minimal support from data scientists, using its patent-pending Wormhole
methodology. Xurmo positions itself as a low-cost, less time consuming self-service platform for enterprises with big
data initiatives, consumer web companies, and analytics service providers. Xurmo works on HDFS for distributed
storage and Spark for distributed in-memory processing.

Astronomer [Cincinnati, 2015]: Astronomer is an open-source data integration platform that helps data engineers to
collaboratively contribute to the code which can be used by data scientists/engineers for integrating web applications.
It is a hosted data pipeline service that can be used to connect various data source like oracle, mysql, salesforce etc to
destinations like cloud platform amazon redshift, bi tools, google analytics and website analytics tools like kissmetrics.
The company also provides implementation services to the companies with no data engineer,

68 Big Data Analytics, June 2016


Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Data Preparation (5/9)


Company Details Funding Investors

ZeppelinHub [Seoul, 2011]: ZepplinHub by NFlabs is a tool for the enterprises to share insights and visualzations
created in Apache Zepplin easily with a web URL instead of passing on PDFs or other attachments. This makes shared
content both interative and updates to the report is pushed down automatically and everyone sees the latest version
of the report.

Informatica [Redwood City, 1993]: Informatica Corporation is a provider of data integration software. It has products
for ETL, Information Lifecycle Management, B2B Data Exchange, Cloud Data Integration, Complex Event Processing,
Data Masking, Data Quality, Data Replication, Data Virtualization, Master Data Management etc. Provides multi-
domain master data management solutions with data integration, data quality, and business-process management
capabilities embedded in the software.Some of the connectors as part of cloud based integration include for
Salesforce, SAP, NETSUITE, Amazon Redshift, Microsoft Azure etc. Had revenue of $1.05B IN 2014. According to
Gartner, Informatica's software revenue from MDM of customer data solutions in 2014 was $106.8 million.

Syncsort [Woodcliff Lake, 1968]: Syncsort provides technology to monitor mainframes, data migration from
mainframes and also ETL and data integration for Hadoop. Syncsort's DMExpress earned the ETL World Record in
2008. DMX-h can dynamically split the data coming from mainframes and load it to HDFS in parallel. It is also certified
for spark. By combining new unstructured and leagcy data it enables enterprises to get more insights.

Coveo [Quebec City, 2005]: It is an enterprise search technology which consolidates organizations ecosystems of
record in real-time, and provides unified search, dynamic 360-degree views of information, and contextual, proactive
Access Capital, Telesystem, Tandem,
recommendations of relevant content and experts using text analytics. Creates context-relevant information from
$69.7M iqventureadvisors.com, BDC, Fonds de
both structured and unstructured data and puts the most helpful case-resolving content at agents and customers. It
solidarit FTQ
measure usage and adoption to optimize performance. Offers connectors to more than 50 platforms. Customers
include the likes of L'Oreal, Lockheed Martin, Bombardier, CAE, YUM! Brands, GEICO and SunGard.

Lucidworks [Redwood City, 2007]: Lucidworks provides a commercial distribution of Apache Solr, offering the
benefits of open-source Solr combined with an enterprise-grade toolset and support. In Sep 2014, the company Granite Ventures LLC, In, Walden
introduced Fusion, a new version of its search platform that gives companies the ability to gain insights from massive $61M International, Shasta Ventures, Allegis
amounts of unstructured data. It added advanced machine learning and signal processing capabilities to make Capital
searching faster and easier. Its clients include Sears, Verizon, ADP, Raytheon, Qualcomm, Ford, MapR and Cisco
69 Big Data Analytics, June 2016
Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Data Preparation (6/9)


Company Details Funding Investors
Recommind [San Francisco, 2000]: Recommind’s enterprise search and categorization platform automatically
organizes, manages, and distributes large volumes of information from multiple sources. With faster access to the
Kennet, Silicon Valley Bank, Sapphire
right information, organizations can save time, enhance the quality of work product, increase the value of information $24M
Ventures
assets, and improve competitiveness and profits. Recommind has been positioned by Gartner Inc. as a “Leader” in its
2015 Magic Quadrant for E-Discovery Software report for the fourth consecutive year.

Sinequa [Paris, 2002]: Sinequa provides a real-time Big Data Search &; Analytics platform for Fortune Global 2000
companies. It offers users Unified Information Access to all textual and database data, supported by powerful
analytics. Strong visualization enables intuitive and conversational discovery of actionable information. Both
employees and customers - with real-time, intuitive, business-focused access to all relevant information wherever it
may reside. We believe that mastering information access is a tremendous challenge within most industries today.
Powerful Content Analytics and easy Unfied Information Access are the keys to reducing stress, eliminating ineffective
decisions, and avoiding poor customer and employee management. They are essential to enabling enhanced
productivity, innovation and collaboration, all resulting in better business performance and customer loyalty.
$5.33M XAnge Private Equity, Aurinvest
Hundreds of thousands of people in more than 250 organizations rely on Sinequa’s intuitive tools to create search-
based applications and integrate intelligent Enterprise Search functionality into enterprise- and industry-specific
applications. Customers includes banking, consulting, consumer products, government, media, telco, manufacturing
and retail. Sinequa provides customer-facing employees and call center agents with an instantaneous, 360 degree
view of all customer history and activity, thus improving customer service and satisfaction while reducing service
costs. Revealing implicit social networks of experts / expertise helps find the right people when staffing a project, find
internally available expertise on a given subject, avoid redundant research and development projects, accelerate
product development and shorten time to market.
Q [Brooklyn, 2009]: Q-Sensei builds powerful indexing technology for companies search data. The platform analyzes
and processes both structured and unstructured data from any source, be it databases, document servers, SharePoint,
CRM or even Internet-based information or social media feeds such as Twitter and Facebook. Its multi-dimensional
search technology encompassing search, discovery and analytics platform and search-based applications to explore,
$1.18M bmt.de, Verizon
control and leverage the wealth of data. Q-Sensei's enterprise platform was noted as a "Trend-Setting Product of
2012" by KMWorld Magazine and the company was awarded an IT Innovations Award(Innovationspreis-IT) at CeBit in
March 2012. It was recognized as an "Innovative Business Analytics Company to Watch under$100M in 2011" by IDC
and received the "2011 North American Enterprise Search New Product Innovation Award" from Frost &; Sullivan.

70 Big Data Analytics, June 2016


Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Data Preparation (7/9)


Company Details Funding Investors

GraphScope [Karlsruhe, 2014]: GraphScope is the smart data search engine for RDF graphs. It interprets keyword
queries, uses advanced algorithms to interpret keyword queries and translates them internally into SPARQL queries.
Current solution are provided for life sciences, publishing and public sector for knowledge management

AuriQ [Pasadena, 1996]: Essentia is a cloud based solution designed to simplify and accelerate the process of
managing and analyzing structured, semi-structured and unstructured data from a variety of sources. Designed from
the ground up to address common issues when working with Big, Complex and/or Dark data, Essentia combines
innovative software and data processing techniques with in-memory and parallel computing, to achieve performance
gains at every level of the data analysis workflow. It provides two solutions: Data preparation to cleanse, blend and
harmonize data from heterogeneous sources and data exploration to browse, sample and query directly from raw
data, in-place and as-is.
Lexmark [Lexington, 1990]: Lexmark creates enterprise search software and hardware and services that remove the
inefficiencies of information silos and disconnected processes such as documents, files, email and other types of
information being created at different locations. With Enterprise Search, users can rapidly access the information they
need and then take action such as completing a task, solving a problem or advancing a business process. And with
faceted navigation, conversational search, natural language support and other intuitive features, users don’t have to
enter perfect queries to get the right results. Its technology offers class-leading content discovery and output
technology that unlocks valuable content regardless of where it exists—repositories, SharePoint sites, email systems,
network shares, intranets, extranets, websites, databases, social media and all other places.

Kimola [Ankara, 2011]: Kimola is a big-data company providing search, semantics and analytics services on cloud.
Company started in Microsoft Innovation Center in Ankara, Turkey. It provides enterprise search to analyze
unstructured CRM data such as flat files, email, conversation to give real time insights.

71 Big Data Analytics, June 2016


Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Data Preparation (8/9)


Company Details Funding Investors
Constellio [Quebec City, 2006]: With Constellio’s Open Source Enterprise Search, users can find pertinent information
in all the existing business applications. In a single interface, Constellio loads to index all of your enterprise’s
information sources, whether the data is structured or unstructured. It provides a simplified interface for a powerful
search including faceted search, intelligent spell checking, customization of displayed results, configurable sorting
results, and saving and sharing of search history and a security system that guarantees full respect for the legacy
security of your systems.
Mindbreeze [Linz, 2005]: Mindbreeze GmbH is a provider of software products that is used for finding relevant
information from corporate data and the Internet. Products includes Mindbreeze InSite and Mindbreeze InSpire.
Mindbreeze InSite is a SaaS based website search tool. Mindbreeze InSpire unites business facts from company-
internal data sources and from the Internet in one semantic search index. Mindbreeze is represented in the Gartner
Magic Quadrant for Enterprise Search in 2015 for 8th time in a row. Siemens, Ober Bank, Dunlop, Futurezoe.at are
some of its customers. Prices varies depending on number of documents to be searched and no of queries.
TextWise [Rochester, 1994]: TextWise developed the first scalable, automated, semantic similarity search technology
enabling the web to move from matching keywords to a meaning-based foundation. TextWise’s semantic technology
would enable major search/content players to index, match and retrieve disparate content and enable other
applications that leverage the meaning of content. Indexing content with semantic descriptions enables any
application to both discern what the content is about and to provide highly relevant matching, concept tagging and
categorization. TextWise holds a significant patent portfolio in extraction, search, categorization and classification
using both NLP and statistics.

Expert System [Modena, 2000]: Expert System are the creators of a patented semantic technology, Cogito that
understands the meaning of written language by using NLP techniques. They design products for specific tasks which
empower businesses, enterprises and governments with intelligence on any data they need. Some of their products
include Cogito Discover, Categorizer, Intelligence platform, risk watcher and more.

72 Big Data Analytics, June 2016


Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Data Preparation (9/9)


Company Details Funding Investors
Sphinx Search [Shoreline, 2007]: Sphinx is a full-text search engine, distributed under GPL version 2. Commercial
license is also available for embedded use. Generally, it's a standalone search engine, meant to provide fast, size-
efficient and relevant fulltext search functions to other applications. Sphinx was specially designed to integrate well
with SQL databases and scripting languages. Currently built-in data sources support fetching data either via direct
connection to MySQL or PostgreSQL, or using XML pipe mechanism (a pipe to indexer in special XML-based format
which Sphinx recognizes).

73 Big Data Analytics, June 2016


Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Data Science Platform (1/5)


Company Details Funding Investors
Revolution Analytics [Palo Alto, 2007]: Revolution Analytics, founded in 2007 to support and foster the R language
Community, as well as support the commercial users. It speaks to the ongoing development of the R language from an
open-source academic research tool into commercial applications for industrial use. Though its Revolution R
$32.2M North Bridge, Intel Capital
products, the company delivers predictive analytics accessible to wide user base. Provides free and premium software
and services bringing productivity and ease-of-use to R enabling statisticians and data scientists to derive insights from
critical data.

Continuum [Austin, 2011]: Continuum’s Python-based data analytics products and consulting services empower
organizations to analyse, manage and visualize big data. Offers bitstar product for free used to make public repository BuildGroup, Silicon Valley Bank,
of Binay package while the Enterprise edition allows making of private Binat package. Their Anaconda server is $27.75M Defense Advanced Research Projects
offered for free but the Enterprise edition comes with Comprehensive Support and training.Customers include Agency, General Catalyst
LinkedIn, NASA, Boeing and JPMorgan.

Dato [Seattle, 2013]: Dato (formerly GraphLab) is machine learning platform that is for data scientists and developers.
Their main product: GraphLab Create is a machine learning platform that can perform: data cleaning, developing
features, training a model, and creating and maintaining a predictive service in an easy to use, fast, and powerful New Enterprise Associates,
manner. Also offers Dato Distributed (distributed execution of machine learning jobs on a cluster of machines in $25.25M MADRONA, Vulcan Capital, Opus
Hadoop, Spark or EC2 etc.) and DatoPredictive Services (hosting machine learning models as REST queryable services). Capital
Customers include Adobe, Cisco, PayPal and Zillow with applications such as item recommendation, fraud detection,
and sentiment analysis.
Adatao [Mountain View, 2012]: Adatao's BI product pAnalytics, built on Apache Spark, is designed for data scientists
and engineers, who can work in familiar tools such R, Python, SQL and Java. The business layer called pInsights lets
end users query the data using natural language queries. The system learns from the data what types of data users Bloomberg Beta, Andreessen
are likely to ask about, and even learns as users query to provide an as you type drop-down capability with likely $13M Horowitz, Lightspeed Venture
queries as you would get in Google search as you enter a search term. Team is comprised of very strong folks with Partners
PhD's and deep engineering experience from Google, Yahoo, and other major enterprise players. Marc Andreessen is
a Board Advisor.

74 Big Data Analytics, June 2016


Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Data Science Platform (2/5)


Company Details Funding Investors
Dataiku [Paris, 2012]: Dataiku’s product Data Science Studio offers end-to-end solution for Data Scientists to make
sense of Big Data. It provides features and environment for connecting to various big data sources, clean the data,
process them with algorithms, use machine learning to build data models, gain insights and visualize them. It also has $3.6M Alven Capital, Serena Capital
Data pipeline to automate the entire process. Depending on the dataset and the computation being applied dataiku
can compute the data in-memory, streaming, in-database or in hadoop.

Sense [California, 2012]: Sense offers a cloud platform for data science teams to collaboratively to run scalable data
analytics and also schedule and deploy them to production. Data can be accessed from Spark, Hive, Impala, Presto,
$1.1M Granite Ventures LLC, Illuminate
Redshift, Hadoop etc. using R, Python, Julia etc. Sense can also be deployed on private cloud or on-premise data
centres.

Domino Data Lab [ , 2013]: Domino provides a platform for data scientists to do collaborative data analysis. It allows
data scientists to use language and IDE of choice to work on data models and deploy them using command line tools GitHub, In-Q-Tel, Zetta Venture
$100k
on to their scalable cloud infrastructure built on AWS or on on-premise cloud. Analysis and reports built can be easily Partners
converted to web-forms etc. for non-developers to run similar analysis.

Revelytix, Inc. [Sparks, 2008]: Revelytix develops Hadoop data management solution, Loom, a software tool which
data scientists use to work with data in Hadoop Clusters. Loom is a web-based application that provides data scientists
with a workbench user interface for working with disparate Hadoop-based datasets. Converts SQL into MapReduce
jobs using an extensible workflow paradigm, automatically updating the Loom Registry with workflow results. Built
data governance and metadata management solution for the department of defence for four years, pivoted to
commercial space in 2012.
Datumtron [New York City, 2013]: Datumtron is an in-memory graph database API for .NET based on The New Datum
Universe Model provides the platform for data mining and machine learning. DatumTron API is a tool to store, query
and mine the data. Data is represented as a directed acyclic graph of "datums" connected by "is" links. Provides
functions and operators to add and update data, query the data and deduce new data. A datum which has an object
associated with it is called Katum. Have 3 major applications : Brings data from a traditional database to in-memory,
data mining system, and intelligent knowledge agent.

75 Big Data Analytics, June 2016


Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Data Science Platform (3/5)


Company Details Funding Investors

Data2B [Rennes, 2015]: DATA2B are Big Data and Data Science experts which make products based on machine
learning models. Partnered with Cloudera, Dataiku, Bretagne Commerce International, Rennes Atalante, and images
reseaux.

DataCanvas [Seattle, 2014]: ZetData offers a cloud service (DataCanvas) to create, manage and share big data
analytic pipelines. ZetData provides a platform to build big data processing pipeline both in cloud and on premise. It
enables developer, data scientist and product analyst to collaborate across team on analytic flows. It can be used to
create advanced analytics and visualization platforms.

Omniscience [Palo Alto, 2016]: Omniscience is a stealth mode data science platform, which is commercializing
distributed data-mining systems that came out of U.S. government intelligence and military research projects to help
companies with brand perception, product safety and reliability, risk management, sales targeting and regulatory
affairs. The company combines internal and external data sources to extract insights using its proprietary
algorithms.that features advanced analytics features such as correlation clustering, collaborative filtering &; vector
space analyses software across 1 million+ dimensions to boost app, retail, streaming music/movies, movie and event
ticket sales. Performs customer analytics to identify urgently need in-band, low latency, micro user-segmentation and
recommendations that are of extreme volume, extreme dimensionality data from purchase history, CRM data, social
chatter, sensors, and location data. Usecases customer conversations, algorithms are useful in drug discovery, deep
supply chain analysis, national security, IT, healthcare, high-end manufacturing, robotics, and self-driving vehicles.

Algonator [Denver City, 2015]: Algonator offers advanced analytics solutions based on data science platform. It is
focused on creation of the self learning AI systems which are learning from the data, non-related to the type of
industry and business in: e-Commerce, Financial Services, ICT, Retail, Government etc.

76 Big Data Analytics, June 2016


Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Data Science Platform (4/5)


Company Details Funding Investors

Lumidatum [Bellevue, 2015]: Lumidatum provides a Data Science platform for building predictive models to optimize
merchandising, make product recommendations and identify the best customers. Enables data preparation and
cleansing through model building to utilize and launch predictions within users own apps and platforms.

SAS [Cary, 1976]: SAS provides business analytics software and services, and is the largest independent vendor in the
business intelligence market for end to end BI solution. It also enables creation of seamless access to the Pig and Hive
languages and the MapReduce framework to explore and visualize data stored in Hadoop to discover patterns and
publish reports. Provides business views delivered through dashboards, visualizations, e-mail alerts, mobile
applications, self-service, and visual data exploration tools. Gartner positioned SAS as a Leader in the Magic Quadrant
for BI and Analytics Platforms in 2015.
Yet Analytics [Baltimore, 2014]: The Yet Core is an analytics platform and scalable xAPI LRS database purpose-built
for the collection and analysis of human and machine performance data.Yet Analytics builds a big data analytics
platform and scalable xAPI LRS database Yet’s data-driven training platform for companies is built on the Department
of Defense-developed xAPI. The Yet Core standardizes job training data from various sources, for the development of
A-Level Capital, Baltimore Angels,
wearables, beacons, augmented reality and the Internet of Things.Yet Core™ is a data analytics platform that helps $1.3M
pantherangels.com
businesses and organizations gather data from across their systems in order to identify trends and patterns of
behavior among their employees and to use this people data to improve operational outcomes. Its platform gathers
data from web and mobile sources, but also to gather data through non-traditional sources such as wearables,
Internet of Things devices and sensors, and virtual and augmented reality systems.

Outlier [San Francisco, 2015]: Outlier is currently in stealth mode . It provides an AI based BI solution founded by the
$1.2M
founder of flurry which provides mobile analytics and advertising solution.

77 Big Data Analytics, June 2016


Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Data Science Platform (5/5)


Company Details Funding Investors
Simularity [San Francisco, 2011]: Simularity provides does real time event prediction, predictive maintenance
(Condition Based Maintenance), and anomaly detection. Simularity has proprietary methods to bring cutting edge
machine learning right to the edges of the network, making connected devices smarter. It models easy-to-understand
predictive patterns in massive amounts of disparate data through its industry-specific algorithms based on its
proprietary High Performance Correlation Engine (HPCE). The design of our HPCE is totally focused on massive
scalability for correlation and similarity metrics.

78 Big Data Analytics, June 2016


Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Data Visualization (1/2)


Company Details Funding Investors
Birst Inc. [San Francisco, 2004]: Birst provides SaaS based Business Intelligence platform for enterprise. Its application
are both cloud hosted and on-premise for data discovery, visualization and analytics on multiple datasources. It
brings data from different data sources and puts it on its platform for various stakeholders to make better business Wellington, Northgate, Sequoia
decisions. It offers self-service BI, reporting, analytics and dashbards. Birst customers include American Express, $139M Capital, dag ventures, Hummer
Aruba, Cisco, Citrix and numerous other. Birst partners with more than 60 solution providers across the Americas, Winblad Venture Partners
EMEA and APAC. Technical partnerships include Amazon, NetSuite, Salesforce.com. In 2015 Birst is growing its
revenues at a rate of between 80% and 100% annually.

CartoDB [New York City, 2011]: CartoDB is an opensource cloud based mapping platform. Users can use the Vizzuality
provided platform or can deploy their own instance of the open source software. It enables users to perform analyses,
Accel Partners, Salesforce, Earlybird,
visualise data on maps and share results. Pricing ranges from freemium to $299/month for individuals to $7,188/year $31M
Kibo Ventures
for enterprises. They have over 70,000 users out of which more than 800 are paying customers. Enterprise customers
include IBM, Real Madrid, Twitter, Wall Street Journal etc.

Tableau [Seattle, 2003]: Tableau Software (NYSE:DATA) provides software applications for analytics and visualization.
Its products include Tableau Desktop, Tableau Server and tableau online for cloud deployments. Some of the products
can work with huge data live or in-memory, mash up data sources, visualize data in multiple ways. Users can build $15M New Enterprise Associates
dashboards, create interactive data applications. Famous companies likes Playdom, Wells Fargo, Zynga, eBay, etc use
Tableau Software’s business intelligence solutions.

nflabs [Seoul, 2011]: NFLabs develops and distributes Apache Zepplin, a web based tool to enable interactive data
analytics - data ingestion, discovery, analytics and visualization. It can be used with many Big Data frameworks like
Big Basin Capital, Coolidge Corner
Spark, Flink, Ignite, Tajo, hive etc. It has an interpretor which allows more such engines/frameworks to be integrated $1.5M
Investment, Bonangels
with zepplin. Sharing of the visualizations or Insights is made easy with URL sharing and broadcasting in real-time. It
also provides a manager to easily install and manage zepplin.

pixlcloud [San Francisco, 2009]: Pixlcloud provides data visualization application for Cyber Security. Allows to
understand unfamiliar data to identify anomalies, mis-configurations, outliers, trends, and relationships.

79 Big Data Analytics, June 2016


Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Data Visualization (2/2)


Company Details Funding Investors

Metricforce [Mountain View, 2015]: Metricforce claims to be an AI enabled mobile first BI technology which uses Big
Data to help manage business metrics from one place. The founder has worked previously with IBM and VMWare.

80 Big Data Analytics, June 2016


Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Search Based Analytics


Company Details Funding Investors
ThoughtSpot [Palo Alto, 2012]: ThoughtSpot is built to be used with zero training to ask questions, analyze company
data, and build reports and dashboards - all in seconds - using a browser-based search interface. ThoughtSpot
combines data from data warehouses, Hadoop, on-premise and cloud apps, and spreadsheets, scales to billions of Khosla Ventures, Lightspeed Venture
$90.7M
rows, and deploys in under an hour using in-memory technology. The company's founding team has previously built Partners, General Catalyst
market-defining search and analytics technologies at Google, Amazon, Oracle and Microsoft. Some of its customers
include Rambus, Sterlingback Check, Forrester Consulting and more.
Endeca [Cambridge, 1999]: Endeca Technologies provides big data analytics, web commerce, and business
intelligence solutions. Its core technology MDEX engine enables enterprises to store, manage, search and analyze
unstructured data. Endeca InFront is a customer experience management platform that enables customer experience Silicon Valley Bank, Venrock, Sapphire
delivery with advanced merchandising and content targeting tools for web commerce. Endeca Latitude is a platform $76.17M Ventures, Bessemer Venture Partners,
that allows developing analytics applications which combine structured and unstructured data. Technology doesn't Intel Capital, GGV Capital, N Capital
rely on algorithms to detect patterns but enables a "dialogue" between the user and the data. Acquired by Oracle in
2011. Oracle launched Oracle Endeca Information Discovery for Big Data Analytics.

DataRPM [Fairfax, 2012]: DataRPM provides big data analytics tools with natural language interface. The system
automatically translates the queries in natural-languare into SQLqueries and generates the visualization from the
Interwest, Center for Innovative
retrieved data. It can be used from the cloud or installed on-site. Pricing starts at about $100,000 yearly.Recognized $5.9M
Technology
One of 10 Startups destined to break out in 2014- Tech.co, Top 10 Big Data Analytics- Enterprise Apps Today, Best
New Big Data Solution- American Business Awards and “Cool Vendor”- Gartner in 2014.

TripleHop Technologies [New York City, 1999]: TripleHop Technologies provides Matchpoint which is a context-
sensitive enterprise search products. Combines semantic and statistical analysis to improve user enquiries. Provides
realtime incremental indexing of over 200 document formats including MS Office, PDF, and TXT files. Supports exact
phrase searching, wild cards, proximity, parentheses, Boolean search, stemming, and fuzzy searches. Oracle
integrated MatchPoint technologies with Oracle search offerings.

Xurmo [Bangalore, 2009]: Xurmo develops Hadoop-based DBMS for Machine-Guided Analytics. Xurmo is a fully
packaged, content-aware platform that can consume any type of data, at any scale and at any velocity. Xurmo's
patented architecture supports Search-guided querying and allows users to analyse data with no schema design.

81 Big Data Analytics, June 2016


Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Streaming Data Analytics (1/6)


Company Details Funding Investors
Qubole [Mountain View, 2011]: Qubole offers "as a service" for big data technologies like Hadoop, Spark, Hive, Pig,
Presto on AWS, Google and Microsoft Cloud platforms with its product called qubole data service. It has features of
autoscaling, elastic pricing , cluster management etc. It offers a unified interface for connecting to various datasources Charles River VC, Institutional Venture
and querying them on choice of our technology and infrastructure. It can be used for ad-hoc querying, BI workloads $50M Partners, Norwest Venture Partners,
and application workloads. Created StreamX, an open-source service that ingests the real-time data from Kafka and Lightspeed Venture Partners
persists it to cloud object stores such as Amazon S3. It has won several awards including CNBC Disruptor50 and CRN
Emerging vendors in 2015.
StreamBase [Lexington, 2003]: StreamBase Systems provides software for rapidly building systems that analyze and
act on real-time streaming data for instantaneous decision-making. StreamBase’s Event Processing Platform combines Accel Partners, Horizon Technology
a rapid application development environment, a low-latency high-throughput event server, and the broadest Finance Management LLC, Bessemer
$37M
connectivity to real-time and historical data. Investment banks, hedge funds, and government agencies use Venture Partners, In, Battery,
StreamBase to power applications that increase revenue, lower costs, and reduce risk. The company is Highland Capital Partners
headquartered in Waltham, Massachusetts

Intersec [Paris, 2004]: Intersec founded in 2004, is a data analytics solution provider specifically working on providing
Cartagena Capital, CM-CIC
insights on streaming data for telecom operators. Their strength is in detecting fraud detection and take corrective
Investissement, Highland Europe,
action. To achieve this their solution is built on HDFS and other DB, combined with in-memory processing for fast $30M
Omnes Capital, Cisco, Harbert,
execution. Intersec has over time built big names such as Zain, Orange, SFR, Etisalat, O2, MTS and more telecom
Innovacom
operators.

SpaceCurve [Seattle, 2009]: SpaceCurve is a real-time big data platform designed to deliver immediately actionable
intelligence applications and services. The SpaceCurve platform concurrently ingests, fuses and analyses historical and
Triage Ventures, Reed Elsevier
streaming data from satellites, sensors, weather, Internet of Things, Industrial Internet and other sources at scales to $29.8M
Ventures, Divergent
provide organizations with insights. It is a platform to handle diverse varieties of streaming machine-scale data
sources that continuously generate data at rates of millions or billions of records per second.

Ryft Systems [Rockville Centre, 2008]: Ryft provides bigdata analytics platform for real time streaming via ODBC to
BI &; visualization tools. Supports any high-level programming languages (C/C++, Java, R, Python, and others) .With
the Ryft ONE, users can analyze and act on both batch and streaming data in real-time – instead of having to wait for $19.95M Grotech, Razor
data to be batched and indexed. Data from many sources and formats—batch, streaming, structured, unstructured
can be mined deeper .
82 Big Data Analytics, June 2016
Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Streaming Data Analytics (2/6)


Company Details Funding Investors
Bottlenose [San Francisco, 2010]: Bottlenose helps enterprises identify, anticipate and instigate the trends that drive
their businesses in real-time. Its patent-pending Trend Intelligence engine, StreamSense detects patterns in real-time Transmedia Capital, KPMG, Advancit
streams of data. Finds trends in any kind of unstructured or structured data, social streams, breaking news, broadcast Capital, SocialStarts, ff Venture
media, sales data, stock market data, enterprise data, etc. Combining a new real-time big-data analytics technology $18M Capital, Fenox Venture Capital, Stage
(29 pending patents), it is capable of continuously analyzing hundreds of billions to trillions of changing data points in One Capital, Prosper, Lerer Hippeau
real-time, offering actionable insights for marketing, sales, support, competitive intelligence, and strategy. Nerve Ventures
Center is geared towards marketers, but that’s just the beginning.

Acunu [London, 2009]: Acunu Analytics offers a platform for low-latency, continuous analytics on big data, powering
dashboards and embedded applications to monitor and control environments in industries where high-velocity data Pentech Ventures, Oxford Technology,
$10.12M
must be analysed in real time. The Cassandra NoSQL database is at the core of the Acunu offering. Acunu was Eden Ventures, Imperial Innovations
Acquired by Apple in late 2013.

Truviso [Foster City, 2005]: Truviso data analytics software gives network-driven businessesreal time insight into their
operations through continuous analysis of live production data. These customers have traditional data warehouse
and business intelligence solutions that cannot deliver actionable information from their production systems quickly
enough, or cannot scale efficiently with rapidly increasing volumes of data. The flexibility Truviso customers gain by Diamondhead Ventures, United Parcel
$3M
being able to combine current and historical data in real time provides them with accurate visibility and actionable Service, ONSET Ventures
information for their marketing, sales and operations teams, as well as for their partners and customers.The
technology is used by ad networks, enterprise software vendors and e-commerce websites, among other users.
Truviso also offers custom-branded analytics dashboards.

83 Big Data Analytics, June 2016


Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Streaming Data Analytics (3/6)


Company Details Funding Investors
Activity Stream [Reykjavik, 2013]: Activity stream make it easy for companies to collect data, from multiple sources
without disrupting current infrastructure and are highly relevant for companies that use e.g. multiple cloud services or
SaaS solutions. It contextualize content, in real time. It turns underutilized business data into actionable intelligence
and insights. It does so by observing the actions and activity taking place across your organization’s business systems
to construct a complete, synthesized, picture of everything happening with business relevance. With this complete
eyrirsprotar.is, SEED Capital,
view and full context Activity Stream can recognize patterns, opportunities and threats as they present themselves. $2M
StartupReykjavik, Frumtak
It then provides right-time Intelligence to appropriate stakeholders so that these observations can be proactively dealt
with or taken advantage of. Activity Stream is an Operational Intelligence Layer that can be implemented
unobtrusively on top of current IT infrastructure, without having to rip-and-replace any of your existing systems.
Mission We help our customers gain or retain competitive advantage by making advanced technology available to
them in a way that enhances their current IT infrastructure, without disruption and operational risks.
Gridsum [Beijing, 2005]: Gridsum provides a web, video and search analytics solution based on data warehouse
technology for multinational, domestic enterprises and government agencies in China. Flagship product Gridsum Big
Data Platform performs multi-dimensional correlation analysis and analyzes real-time events with its machine learning
Steamboat Ventures, Nokia Growth
capability. Offers solutions as SaaS. Provides multiple analytics tools such as Web, video, Streaming, mobile, TV, Ad,
Partners
Data, Law, Media and many more dissectors, for real-time analysis. Offers various solutions by combining various
dissectors together. Solutions offered are Marketing Automation Suite, E-Government Suite, New Media Suite,
Information Discovery Suite and Visualization Suite.
Medium One [San Francisco, 2014]: Medium one is a cloud-based workflow builder that allows developers to create
a customized real-time programming workflows. Provides an API which allows data to be ingested, which in turn
triggers the workflow. Workflow is written in Python by using built-in libraries. Also provides nosql integrated
analytics datastore which stores JSON documents and are used by workflows. Workflows are processed on each event
to enable stream processing solution. Dashboards are provided to visualize event data, and also third party
integration is provided. Partenered with Samsung, Renesas Electronics Corporation, Cyber Group and Micrium.

Cynepia [Bangalore, 2013]: Cynepia is an enterprise analytic and big data startup. Currently in stealth mode. Works
with data in real time and make data accessible through channels such as social and mobile.

84 Big Data Analytics, June 2016


Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Streaming Data Analytics (4/6)


Company Details Funding Investors
Apama [Cambridge, 1999]: Apama provides streaming predictive analytics platform. Combines complex event
processing, messaging, in-memory data management, and visualization. Allows analysis of high volume business
operations data in real time. Also develops end-user business dashboards for monitoring. Provides support for MQTT
and AMQP standards and protocols for integration with IoT. Can be deployed on local machine, server or in cloud.
Acquired by Progress Software in 2005. Later Software AG bought Apama from Progress Software in 2013.

Intelie [Rio de Janeiro, 2008]: Intelie provides Real-time stream analytics platform, Intelie Live. Offers cloud or on-
premise solution. Data integrated from multiple data sources such as transactional data, machine data, and sensors is
processed through Intelie Pipes, an in-memory real-time processing engine. Processed data is used for visualization
via dashboard, alert monitoring using predictive analytics, and also provides an extensible platform for new
integrations and new query languages. Awarded cool vendor by Gartner in 2015. Received first place in 2013 TOTVS
Start it Up. Some of its clients are Vale, Walmart, globo.com and many more.

Cetas [Palo Alto, 2010]: Cetas provides real-time predictive analytics. Cetas facilitates real-time and ad-hoc data
analysis and sophisticated pattern extraction. Help business users to gain insights and visibility into their
customer/audience behavior. Was incubated at Clearstone Venture Partners.

jKool [Melville, 2014]: JKoolCloud offers solution for streaming analytics of Big Data. Time series data can be sent to
the Jkool Cloud Compute Grid store to track and analyze streams –in-memory, in real-time. The data can be queried
and viewed with jKool Elastic Views and jKQL. It is built on top of DataStax Enterprise (Apache Cassandra). It is using a
distributed Big Data Repository, an Elastic Grid and a Query Grid. The underlying NoSQL repository is optimized for
time-series data and delivers transparent scale, performance and storage. It is a spinoff of Nastel Technologies.

joojip.com [Bangalore, 2015]: Joojip is a streaming data analytics platform for building real-time and IoT applications.
With Joojip, users can embed real-time analytics into cloud applications by either creating new applications from
ground-up or, by enriching existing applications with its analytics widgets. Its end-to-end stream processing allows
users to collect, analyze and visualize real-time data.

85 Big Data Analytics, June 2016


Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Streaming Data Analytics (5/6)


Company Details Funding Investors

Iris Analytics [Frankfurt, 2000]: IRIS Analytics powers real-time fraud prevention and scoring in all forms of electronic
payment. It claims to have response times below five milliseconds to ensure 100 percent in-flight real-time decisions
for volumes of up to 10,000 transactions per second. Its clients include banks, card processors and payment service
provides across Europe and North America.

QuickLogix [ , 2013]: QuickLogix is enabling businesses to make decisions by combining the power of BigData with the
prescience of experience and expertise. The instant analyst product- Genie- offers an easy to use natural-language
query interface and a dynamic UI visualization portfolio with an intelligent big-data mash-up engine powered by
statistical sciences to obtain deeper business insights.

AKUDA [San Jose, 2007]: AKUDA LABS is offering solutions to process real-time, streaming big data for filtering,
classification, analytics and model building on structured, unstructured and semi-structured data. The company has a
proprietary Pulsar hypercomputing platform as a real-time stream classification engine. It also allows for analysts to
use pre-built models from marketplace to analyze the data. It can also integrate with other BI tools to enable wider
set of features.

Mentat Innovations [London, 2012]: Mentat is developing big data &; machine learning based intelligence platform
at IoT scale. Provides dashboard for real time analytics, statistical modelling on streaming data. Solutions are targeted
at unsupervised IoT machine learning, global real time intelligence and anomaly detection for large enterprise
network &; IoT. Cybersecurity solution flags the malicious activity against a background of shifting legitimate
behaviour within the enterprise firewall. Developed application for tracking &; analysing real time efficiency &;
predictive maintenance of Wind turbines. Proprietary streaming machine learning technology is deployed on big data
platforms like IBM Infosphere Streams, AWS Kinesis and Apache Storm/Spark. Member of the European Cisco
Entrepreneur in Residence program and the Cylon Labs Cybersecurity program in 2015.

86 Big Data Analytics, June 2016


Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Streaming Data Analytics (6/6)


Company Details Funding Investors

StreamAnalytix [Los Gatos, ]: StreamAnalytix is a platform that enables enterprises to analyze and respond to events
in real-time at Big Data scale. It is designed to rapidly build and deploy streaming analytics applications for any
industry vertical, any data format, and any use case.

87 Big Data Analytics, June 2016


Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Verticals->Finance (1/19)
Company Details Funding Investors
BBD [Chengdu, 2013]: BBD aka Business Big Data provides Big Data solutions for the finance industry. Provides two
solutions, HIGGS Kunlun and HIGGS Galaxy. Higgs Kunlun provides a platform to build applications which allows
business users to draw meaningful insights into financial unstructured data sourced from different sources to analyze
on credit risk. Secure and Co-operative analysis can be done over the same data by different organizations. HIGGS
KUNLUN provides a series of integrated analyzing tools on its front end, including semantic, time series, geographic CDH Investments, sanshenggroup.net,
$30.65M
location and textual analyses. Higgs Galaxy is suitable for large-scale quantitative investigation. Tracks data of Goldman Sachs
insurance claims, network trajectory and financial transaction modes. Provides a visualization to platform to analyze
data in forms, chats and diagrams. Have offices in Beijing, Shanghai, Hong Kong, Shenzhen, and Hangzhou, as well as a
branch in Singapore to serve overseas clients. Some of the clients are PricewaterhouseCoopers, KPMG, peace
crowdfunding, Xinhua News Agency, "Fortune" (Chinese edition), Sichuan football clubs.
Scaled Risk [Paris, 2012]: Scaled Risk provides real-time in-memory big data analytics platform for the financial
industry. Platform assures real-time historical and lives trade data analytics that helps investment firms to achieve
real-time risk management and comply with regulatory demands. Provides real-time and transactional integrity to
Hadoop and Hbase. The platform provides a suite of capabilities to integrate, secure, and analyze data. Search
capability gives access to the whole data set of the organization such as market data, trades, positions, reports,
contractual document and excel sheet. OLAP cubes can be created to perform visual analysis on the data.

Datavore Labs [New York City, 2014]: Datavore is a data analytics platform for financial services. It uses machine
learning model for financial data aggregations and supplements a finance pro's intuition, The team have experience
working at companies such as Goldman Sachs, Thomson Reuters, Ernst &; Young, and KPMG.

EidoSearch [Toronto, 2010]: EidoSearch is a search tool allows financial professionals to generate new trading ideas,
analyze macro-economic trends, and construct portfolios by back-testing a universe of securities.It applies advanced
information processing techniques to make the world's time series data searchable. Its customers include some of the
largest financial firms in the world, that uses its software as a search and discovery tool to help their traders, analysts
and portfolio managers make better informed investment decisions.It have been recognized as the top new start-up
to watch by Backbone Magazine's Alpha Exchange Innovation Campaign in 2012, and a winner of the 2012 FinTech
Innovation Lab in New York City.

88 Big Data Analytics, June 2016


Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Verticals->Finance (2/19)
Company Details Funding Investors
Big Data Scoring [Tallinn, 2012]: Big Data Scoring is a European credit scoring company that develops generic and
tailored credit score models based on BIG DATA and social networks. Their big data credit score model for consumer
credit, uses information from the Facebook social network and other sources. Big Data Scoring offers a proven,
generic social media scorecard as a service to creditors around Europe.They work with lenders of any kind – banks
and non-bank lenders, payday and P2P lenders, microfinance providers and leasing companies.

Uptake [Palo Alto, 2014]: Uptake creates customized analytical solutions for companies by collaborating with them.
New Enterprise Associates, Lightbank,
While building the tailored platform for partners they gain access to large datasets and domain knowledge which is $45M
Caterpillar, General Purpose Vehicles
then applied across other industries to build and improve the products.

Maana Inc [Palo Alto, 2012]: Maana is a search &; discovery engine for big data on Hadoop. Maana organizes the
data on Hadoop for discovery, recommendation and personalization through search. Enterprises use Maana to expose Frost Data Capital, Shell, Intel Capital,
their big data on Hadoop to people in their ecosystems - employees, suppliers, partners, customers and/or $40.15M ConocoPhillips, Chevron, GE Ventures,
consumers. Maana is not based on Open source text-search engine library Lucene. Maana Optimizes the Assets and Saudi Aramco Energy Ventures
Business Processes of large Industrial Companies and Oil and Gas Companies.

Striim [Palo Alto, 2012]: Striim is an end-to-end streaming data integration and operational intelligence platform
enabling continuous processing and streaming analytics. It has a flow designer which is a visual pipeline builder and
Summit Partners, Intel Capital,
distributed deployment manager to create and monitor streaming applications. It also has interactive dashboards for $31M
Panorama Point
data exploration powered by push-based, ad-hoc SQL queries on fast, streaming data. It is built using patent pending
proprietary technology.

eLab Ventures, Jump Capital, Mercury


Sight Machine [San Francisco, 2012]: Sight Machine continuously analyses images from industrial cameras, data from Fund, Huron River Ventures, Orfin
sensors, and information from factory systems to improve quality and operations. The analysis is performed for Ventures, O'Reilly AlphaTech
$24.5M
quality, track-ability and while in operations. Sight Machine’s web-based applications enable companies to exchange Ventures, Pritzker Group, A Ventures,
quality information in real time and through a web browser. Two Roads Group, FundersClub, GE
Ventures

89 Big Data Analytics, June 2016


Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Verticals->Finance (3/19)
Company Details Funding Investors

Automsoft [Ireland, 1997]: Automsoft is a bigdata based predictive analytics technology for collecting data from
IDG Ventures, Pentech Ventures,
industrial equipment like oil and gas, power and utillities , smart grids etc. Its core product RAPID, collects, stores and
$10.6M Optimum Asset Management, Cross
analyses data to enable critical decision making in our customers. It has thousands of installations in Utilities, Life
Atlantic Capital Partners
Sciences, Oil and Gas, Food and Beverage, Mining, and Pulp and Paper.

Seeq [Seattle, 2013]: Seeq offers big data analysis solutions that help analyze and understand industrial process data
(IPD) better and faster than standard solutions. Seeq's features include: reduced time for analysis, easier relationship
$8M Second Avenue Partners, MADRONA
discovery, ERP and other systems integration, support for business intelligence (BI) software such as Excel, Tableau,
SAS, and MATLAB and collaboration support. Raised $2M in debt financing on May 30, 2013.

Trend Miner [Limburg an der Lahn, 2008]: Trendminer (product of Dsquare) provides operational intelligence
through big data search and predictive analytics for industrial process data. TrendMiner brings next generation trend
client to the Process and Manufacturing industry. Through advanced pattern recognition algorithms, it can Search the $5.51M http://www.lrm.be/home/en
entire plant information data history, Capture knowledge for future reference and monitor the live data to give users
early warning in case of abnormal behavior or batch run anomalies.

RtTech [Moncton, 2011]: RtTech specialized in real time machine monitoring that turn Industrial data into
information that can be used to improve inefficiencies. Products are built using the OSIsoft PI System and can connect
New Brunswick Innovation
to any control infrastructure that users have in the plant floor. RtTech currently have two products: RtDuet, for $3M
Foundation, McRock Capital
automatic Downtime Monitoring and Asset Performance Tracking and RtEMIS an Industrial Energy Management
Information System that identifies areas within the plant where Energy is being wasted.

Ndustrial.io [Durham, 2015]: Monitoring and Analytics PaaS for production normalized operations. Founding team is
$1.4M bay-grove.com, Acorn Innovestments
from North Carolina State University. Team raised investment of $1.4M from SF based PE firm Bay Grove Capital.

90 Big Data Analytics, June 2016


Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Verticals->Finance (4/19)
Company Details Funding Investors
Beet Analytics Technology [Plymouth, 2011]: Beet Analytics provides diagnostic and analytical tools that accelerate
problem solving in complex manufacturing and automation operations. Software improves problem identification and
reduce production downtime. Its flagship product Envision software is a predictive maintenance software. Envision
enables user to digitize all of manufacturing operations and monitor them anywhere in real time. It ensures that all $1.25M Macomb OU-INCubator
assembly processes are operating with in acceptable parameters and compiles data &; presents it in EKG style graph,
making problem identification easy. Beet currently features two versions of Envision: System Scope (for Single
Machine), Factory Smart (for whole line or entire factory). Beet is the Automation Alley award Winner for 2014.
COVACSIS [Mumbai, 2009]: Internet of Things solution for the manufacturing industry, providing real time
manufacturing process diagnostics. Its Intelligent Plant Framework captures all micro events across all locations on the
plant floor and models them into key and extremely relevant business KPI’s. Its proprietary tool, RTPD, measures the
India Venture Partners, Reliance
economies of the production floor of any industry. After a preliminary check on the various revenue leaks in a $460K
Industries, Cisco, Blume Ventures
production floor, the RTPD then provides information about capital gained or lost due to various operations of the
floor. Has a target of reaching Rs 100 crore in revenues by FY17. In the current financial year, the startup expects
contract booking of Rs 25 crore. Current customers include Sun Pharma, Welspun, Godrej, Trident Group.

Visual Action [Buffalo, 2015]: Visual Action develops visualization based products. It provides platform for data
discovery and analytics. Offers Flaremap applications suite for performance management, anomaly detection, asset
management, and risk compliance. Flaremap applications suite consists of Flaremap Studio for designing flaremap $160k
application, Flaremap JS for cross-platform deployment, Flaremap Rich for customized deployments with rich
controls, and Flaremap Thin to extend treemap application to mobile users. Partnered with MapR and HortonWorks.

Arcstone [Singapore, 2013]: Arcstone Operations Platform provides SaaS platform to maximize productivity, reduce
production costs and improve product and service quality. This data is transformed into real-time monitoring and
Undisclose 500 Startups, Wavemaker,
control, decision support and process optimization for C suite management, supervisors and managers on down to
d ysscapital.com, Global Brain
line workers and shop floor personnel. Claims to have current client base among Fortune Global 500 Companies to a
range of domestic SME’s in Singapore.

91 Big Data Analytics, June 2016


Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Verticals->Finance (5/19)
Company Details Funding Investors

MachineMetrics [Northampton, 2014]: MachineMetrics is a machine monitoring solution for manufacturing that
collects and visualizes data from machines to improve production performance. It provides a real time dashboard
Long River Ventures
allowing operators and managers to keep tabs on production at all times and predict failure before they occur.
Customers include VSS Inc, Massachusetts.

Raven Telemetry [Ottawa, 2013]: Raven provides machine data analytics and connects to any machine and collects
critical performance data including uptime, error counts and codes. Its universal hub aggregates data and transfer to
its cloud server. Smart algorithms translated data into actionable insights and its real time dashboard provides clear
metrics and actions for supervisors and operators.

Dattus [Indianapolis, 2013]: Dattus, formerly known as Bearing Analytics provides asset monitoring and failure
prediction solutions for rotating machinery and other industrial applications. Provides sensor technologies, and a
cloud insfrastucture for sensor data management, benchmarking, and big data analytics. Developed sensor based on
their patent pending Bearing sensor technology that allows multi-parameter monitoring (temperature, vibration, and
lubricant quality) using a single sensing element on the bearing cage, enabling unprecedented measurement accuracy
and an instantaneous response to failure causing conditions. Received a $100k grant from Founder.org, a San
Francisco-based global student entrepreneur investor and company-building program.
Braincube [Clermont-Ferrand, 2007]: Braincube earlier known as IP leanware, provides a performance management
platform for manufacturing companies. Collects raw production data from different systems and links by applying
time offset. Data analysis is performed on current data, by comparing with data standards set on basis of historical
performance. Performance can be monitored in real-time by creating and monitoring performance indicators on the
dashboard. Partnered with with Cap Gemini and Engie, who propose the solution as the Manufacturing Intelligence
standard. Have offices in France, US and Brazil.

92 Big Data Analytics, June 2016


Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Verticals->Finance (6/19)
Company Details Funding Investors
TransVoyant [Washington D.C., 1995]: TransVoyant streams and analyzes the world's live data, enabling decision
makers to get in front of global risks and opportunities. It delivers real-time intelligent decisions to global customers.
Employs a team of cleared analysts, consultants and technologists to assist clients in designing, implementing and
managing solutions for outcome-focused intelligence. TransVoyant solutions are used by business and government
customers to make real-time decisions with high-velocity live location, time, context and preference information. Its
decision and predictive analytics technologies , combined with its long history of data collection on-time and on-
budget solution delivery, improves outcomes for intelligence, defense, supply chain and risk management customers.
Canary Labs [Martinsburg, 1996]: Canary Labs provides real-time enterprise historian and trending solutions that
simplify and optimize data analysis driving more informed, confident decisions. Its open, flexible and high
performance software improves process metrics and increase the agility, efficiency and reliability of data . Its
platform collect, store, and display critical data for advanced trending, analysis and reporting. These applications
include the Canary Enterprise Historian, Canary Trend Link, Canary InfoLink and the Canary Smart Meter Solution. It
has a client base with over 10,000 installations in 26 countries. Canary Labs solutions are used in the energy, water
and wastewater, manufacturing, mining and metals, facility and data center, fiber optic and communications, and
process industries.
Analytika [Boston, 1989]: Analytika (part of Cimetrics) provides a big data analytics platform for data analytics to
transform building and manufacturing process management. Analytika for Buildings (AFB) provides energy savings,
sustainability and tenant comfort to energy, facility and sustainability managers in the Pharmaceuticals &; Life
Sciences, Higher Education, Healthcare and Government market sectors. Analytika for Process (AFP) serves the needs
of manufacturing and quality leaders and delivers improved process reliability, quality and throughput, while reducing
the risk of catastrophic failures and regulatory compliance issues. For industry it leads to consistently higher-quality
yields, less downtime, and greater operational efficiency

Oden Technologies [London, 2014]: Oden Technologies is an Industrial IOT company, developing both hardware and
software solutions to facilitate factories. Oden techs hardware goes into existing machinery facilitating collection of
real time data from the production line. Information is then relayed to their cloud based analytics platform, where a
visualisation of the process flows can be observed and analysed, in order to find distortions or glitches to address.

93 Big Data Analytics, June 2016


Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Verticals->Finance (7/19)
Company Details Funding Investors
DecisionIQ [Atlanta, 2014]: DecisionIQ is a data driven platform and a decision support system for predictive
maintenance and asset optimization. It curates all data types from equipment sensors to logs and documentation.
Offers services like operational maintenance risk, cost optimisation logistical support, anomaly detection and
predictive failure. It's primary application is for predictive analytics in the IT, transportation, energy, manufacturing
and industrial sectors.

IntelliSense.io [Cambridge, ]: IntelliSense.io deploys sensors, software and services to improve efficiency of
commercial and industrial sectors. Uses its Brains Infrastructure, which consist of Intelligent Sensors and Wireless
Sensor Network along with its Brains App, a web-scale analytics applications platform. Its solutions are being deployed
in manufacturing, food &; beverage factories, oil &; gas industries..

Sightline Innovation [ , 2012]: Sightline is a machine and deep learning cloud services company that specializes in
advanced quality inspection and data analytics servicing in the multiple verticals from healthcare to manufacturing.
They use Sightline Perception Engine (SPE) to serve the immediate needs of industry. SightLine Innovation has been
recognised as one of the top 20 most innovative companies in Canada by CIX.

EroNKan [Bangalore, 2014]: EroNKan is a cloud based data driven platform which collects data from the machines
and performs analytics in order to provide insights into the efficiency of the process, quality of the production, volume
of production and others. It alerts the employees on the parameters critical to the organization through SMS or email.
Provides real time analytics and KPI dashboards on the machine data. The platform can be customized based on the
requirements and also offers an on-premise platform.

Sense4Things [Dubai, 2013]: Sense4things is a Dubai based company developing solutions for machine, asset
utilization, workflow &; employee productivity, supply chain &; logistics. Sense4things also resells Thingworx industrial
IoT platform by PTC.

94 Big Data Analytics, June 2016


Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Verticals->Finance (8/19)
Company Details Funding Investors
1-enterprise [Bangalore, 2013]: 1-enterprise offers big data analytics solutions for manufacturing businesses as well
as IoT solutions. Duuramatics is a proprietary solution that enables manufacturing organizations to collect data from
their installed machinery and analyse data real time. Planning, cost &; finance, inventory &; sales data are gathered
and brought together using custom built adapters into SAP HANA enabling analysis and execution of complex
algorithms with sub-second response times. Dashboards and Reports provide information to users on a web portal or
handheld devices. 1-enterprise also offers AC Monkey - an IoT application for remote management of commercial air
conditioning systems.
Sabisu [London, 2010]: Sabisu is a data aggregation and predictive behaviour platform for oil &; gas, manufacturing
and chemicals. It provides the real-time analytics, visualisation and collaboration capabilities needed for decision
support in oil &; gas and petrochemicals. It uses industrial data to provide decision support through MS Excel based
reporting, the calculations engine which are C, C++ based and can perform simple linear calculations through to
complex n-dimensional analytics. Its web integration services support is available through APIs to OPC, ODBC/OLEDB
to SQLPlus, NoSQL to Sharepoint etc. Customers include petrochemical companies like Sabic.

MAJiK [Kitchener, 2013]: By securely connecting directly to CNC controlled machines, ERP systems, and any other
data source, MAJiK Connect provides near real-time monitoring and performance analytics for keeping production on
schedule. The platform enables users to react to machine down-time issues quicker and identify which issues are
costing the most. Provides comparable analysis to understand if actual production is meeting scheduled production -
Quantify machine and factory performance, develops customized dashboard suited to factory's needs

DATAmaestro [Belgium, 2002]: Datamaestro is a product of the company pepite which provides cloud based
predictive analytics tool for heavy industry. Data maestro helps in optimizing machine utilization, downtime,
monitoring and maintenance to improve operational efficiency. Its solution includes historical data analysis, data
validation, and data mining. Clients includes Jindal, Burgo, Rosier group, NLMK, Irving, Prayor etc.

Predikto [Atlanta, 2012]: Predikto provides Predictive Analytics to improve visibility and equipment reliability of rail,
aviation, and industrial fleets. Its solutions include device visualization, asset utilization, regulatory and safety
compliance etc, Its machine learning SaaS engine monitors the equipment condition and sensor data to provide
warnings of abnormal equipment health. The company has been mentioned in IoT evolution, ABI research, Innotrans
etc.
95 Big Data Analytics, June 2016
Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Verticals->Finance (9/19)
Company Details Funding Investors

Element Analytics [San Francisco, 2014]: Element Analytics develops industry specific solutions for heavy industry
application. The data scientists can use the platform to explore and analyze industrial data, by automatically
incorporating contextual influences. The product can be used by customers in the Power Generation, Manufacturing,
Transmission Distribution, Mining, Utilities, Chemicals, and Facilities sectors.

Samkhya Technologies [Kharagpur, 2014]: Their product offering MAT is an analytics tool for the mining industry. It
collects data from various stages of mining through GPS, software packages , mobile and web based apps. Users
interact with the data through a variety of integrated applications built on top of the platform. Generates rich
visualizations. Tables, scatter plots, charts and monthly reports. Can be integrated into existing data generating
softwares like SAP/ERP. Also offers operation wise module solutions to its customers. Looking for a CTO as of Jan 2015

Unitary [Mohali, 2015]: Unitary provides an analytics platform for industries to track the equipment condition and
manage operational efficiency. Offers equipment performance management through sensors, predictive maintenance
and workflow management. Features include integrated dashboard for managing operations, real time data
collection, report generation and historical records of the equipment on the failures and action taken over the time.

Senseye [Southampton, 2015]: Senseye has developed predictive analytics engine to analyze any kind of sensor data
generated by industries. Senseye implements machine learning, semantic annotation and predictive analytics
algorithm to sensor data compiled with other sources to give user insights, trends and maintenance alerts.

Arundo [Palo Alto, 2015]: Arundo analytics is a real time operational intelligence platform which collects ,filters and
segment the data from industrial equipments and IoT sensors to monitor the system performance. by applying big
data, machine learning and predictive analytics techniques, they are able to predict likely failure and other typical
scenarios that require actions. This information is presented via easy to use dashboards that allow you to monitor and
take action well in advance of a problem resulting in increased efficiency, uptime and revenue.

96 Big Data Analytics, June 2016


Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Verticals->Finance (10/19)
Company Details Funding Investors

OSIsoft [San Leandro, 1980]: OSI Soft a 1980 founded company is a United States based developer of real time data
infrastructure solutions. OSI' s trademark software PI system is capable of capturing, processing, analyzing, and store Kleiner Perkins, Technology Crossover
$140M
any form of real time data. PI system find scale usage in several industries like oil &; gas, materials, mining, power etc. Ventures
In 2015 OSI Soft was awarded with IoT Innovations award by Connected World Magazine.

Space Time Insight [San Mateo, 2008]: Space Time Insight's technology turns large quantities of disparate Opus Capital, Start Up Farms
information into visual displays to enable businesses to visualize and analyze their resources across location, time, and International, Zouk Capital, Novus
node, rapidly respond to disruptions in service The company provides real time visual analytics software for big data $42M Energy Partners, EnerTech Capital,
in a range of industries. Clients include Southern California Edison, Florida Power &; Light, San Diego Gas &; Electric NEC Corporation, Informatica, E.on,
and California’s grid operator among others. Opus Capital

BitStew [Burnaby, 2005]: Bit Stew Systems removes the complexity of industrial operations and connected machines
Yaletown, Silicon Valley Bank, Cisco
to give clarity and control back to the operator. Purpose-built for the Industrial Internet, Bit Stew's Mix Core platform
$22.5M Investments, Kensington Capital, BDC,
automates data ingestion, applies machine intelligence to learn patterns in the data, allowing industrial companies to
GE Ventures
discover actionable insights that optimize operational performance.

ParStream [Cologne, 2008]: ParStream provides a real-time big data analytics platform and databases targeted
towards IoT applications. Their database (Parstream DB) is a distributed, massively parallel processing columnar
database based on a shared nothing architecture. It provides sub-second response times on billions of data records
while continuously importing new data. ParStream's Analytics Platform uniquely queries at the source of data for real-
CrunchFund, Khosla Ventures, Baker
time analysis as data is being loaded. It also provides unified analytics of real-time data in every query and generates $13.6M
Capital, Tola Capital, Data Collective
more accurate insights for decision-makers with the continuous import of new data. It also includes innovative tools
such as Geo-Distributed Analytics, Alerts &; Actions, Time Series, Advanced Analytics, interfaces for the leading
streaming/ETL technologies, and seamless integration of the leading visualization tools for IoT. It was named a 'Cool
Vendor' in Advancing Data Management Maturity by Gartner in 2012 and was listed among CRN Big Data 100 in 2014

97 Big Data Analytics, June 2016


Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Verticals->Finance (11/19)
Company Details Funding Investors

TempoIQ [Chicago, 2014]: TempoIQ is an end to end platform to connect devices to cloud, store data, analyze the
Divergent, Chicago Ventures, Hyde
data and view the results on its dashboard. TempoIQ composer enables user to make real time cloud based
$4.068M Park Venture Partners, Techstars,
dashboards and applications without using coding. Usage in industrial automation, medical devices, oil &; gas,
Data Collective, Hyde Park Angels
resource management, smart grid etc. Company has pivoted from initial offering of data base service.

GoFactory [San Francisco, 2011]: GoFactory is a San Francisco based Industrail IoT company that provides cloud
service to connect machines, sensors, systems and people to drive intelligent action and response. Gofactory connects
$2M Visionnaire Ventures
workforce to assets by filtering out real time data and identifying faults in processes and directing the right person to
it. GoFactory was named to Gartner’s Cool Vendors in Mobile Security and IoT Security list for 2015.

Barrage [Madison, 2015]: Barrage, a product of MIOsoft, provides real-time analytics solution for IOT. Sample data
can be uploaded or live stream can be hooked using the SDK. APIs are used to connect Barrage to applications.
Provides graphical console for feedback. Also provides historical analysis along with real-time monitoring to spot if the
event happened in the past. On occurrence of an event, pattern matching is done to create the alerts in real-time.
Platform is available for on-premise and OEM applications.

Introspective Systems [Portland, 2010]: Introspective Systems provides xGraph technology for managing data and
analytics for an oil and gas exploration company, seismic networks, medical diagnostics and smart cities. xGraph
technology provides an architecture which functions on the concept of Autonomic computing system much like a
human nervous system. Provides a framework where the processing takes place in the graph data structure by
broadcasting the data across distributed analytics. Used by developers to implement the structure of an IoT or
distributed analytics application which combines streamlined method to handle communication, interface and data
stores.

98 Big Data Analytics, June 2016


Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Verticals->Finance (12/19)
Company Details Funding Investors
Trax Technologies [Scottsdale, 1993]: Trax technologies provides data about logistics transactions that are often
incomplete, inaccurate or inconsistent for buyers and sellers of logistics services. It not only identifies bad invoices but
also fixes bad data at its root. Trax solves these problems through technology and services for both buyers and sellers
Capital Southwest, River Cities Capital
of logistics services. Trax offers a cloud platform that assures logistics data can be trusted. It also provides companies $20M
Funds
with accurate insights through predictive analytics. It uses big data technologies to leverage Trax’s one-of-a-kind
repository of over 1 billion logistics transactions from all industries, modes and countries to help businesses tune their
operations for optimal results.

enVista [Carmel, 2002]: enVista is a enterprise cost management services provider, offering expert consulting and
technology services from source to consumption. Enables leading companies to reduce operating costs, improve Borealis, Point Judith Capital, Egan-
$13.6M
customer service and enhance profitability through innovative solutions and deep domain expertise across a Managed Capital
multitude of industries and vertical markets.

LogiNext [Mumbai, 2013]: LogiNext helps logistics companies improve their internal operations, optimize delivery
networks and provide superior customer service using data collection, advanced analytics and visualization. Helps
supply chain and logistics companies to track their shipments, delivery boys, vehicles and other assets on a map
interface. Its first product, Track-A-Pack is a location-based analytics solution with a GPS enabled tracker that can
placed to any shipment and mapped with the invoice number using the free iOS or Android App. The tracker collects
real-time data and helps customers to identify slowest hubs, longest delivery times and the best routes to follow $10.5M Indian Angel Network, Paytm
between any two locations. The smart tag can be rented and the customer can pay only for the day in which it is in
use. Also offers analytics like identifying the trends in typical delays, blockages that happen in deliveries etc. Carnegie
Mellon founding team. Part of 10 startups in the first batch of GenNext Innovation Hub. Raised around $500k from
Indian Angel Network in seed funding which valued the company in the range of $2-3 million. Raised $10M series A
funding from PayTm in Sep '15. Acquired last mile delivery company YourGuy in April 2016.

Chainalytics [Atlanta, 2001]: Chainalytics provides supply chain consulting, analytics, and market intelligence. Its
specialties include Supply Chain Design, Sales &; Operations Planning (S&;OP), Logistics Operations, Transportation,
$10M Global Environment Fund
Service Supply Chain, and Packaging Optimization. Provides robust analysis of situations to enable clients to effectively
make decisions based on data, consideration of all available options, and prediction of outcomes.

99 Big Data Analytics, June 2016


Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Verticals->Finance (13/19)
Company Details Funding Investors
CargoSense [Reston, 2012]: CargoSense is a big-data and analytics SaaS company, which has developed supply chain
software solutions to optimize logistics networks in healthcare, food, medical devices, and other industries. New Dominion Angels, Middleburg
Developed iOS based apps that uses Bluetooth sensors to collect data. Also offers logistics data management $6.1M Capital Development, Ltd., Center for
application for the iPad that provide shipping container climate data management solution from the shipper to the Innovative Technology, IrishAngels
consignee.

Weft [Burlington, 2013]: Weft platform which integrates with Weft sensors as well as sensors from other provider
and with ERP and CRM systems, users log in to get the current status and to set alerts for out-of-bounds conditions
(location, temperature, moisture, shock/vibration) and stages completed. Through predictive analytics, Weft takes
the current location and planned route of all shipments on the platform and combines this information with historical
data to identify likely bottlenecks and alert shippers to potential problems before they occur. Weft works by simply $3M Andreessen Horowitz, Data Elite
attaching a small piece of hardware to the valuable cargo in transit to receive real-time monitoring and analytics,
allowing companies to check up on their shipments and see if there's a better way to get from point A to point B. Weft
uses the simple GPS technology to find patters, problems and solve them. Participated at SAP TechEd in Las Vegas on
October 21, 2013. Working with Vodafone, Telefonica, and Etisalat on different initiatives.
Pivot [Austin, 2013]: Pivot Freight provides retailers and manufacturers visibility into their inbound supply chain
operations through cloud-based, SaaS software. Its shipping platform enables real-time visibility into in-transit
inventory - delivery exceptions, vendor and carrier scorecards, predictive analytics and directly connects carriers and
$2.8M Silverton Partners, Capital Factory
vendors, optimizes carrier choice, fully executing the shipment. For shipments not executed on the platform, its
proprietary Data Ingestion Engine continuously collects in-transit shipment data from integrated sources and provides
robust analytics and the ability to observe, measure and act in real time.

Traxens [Marseille, 2012]: Traxens uses breakthrough technology for cost effective data capture from multimodal
containers coupled with Big Data techniques to deliver actionable information to the right people at the right time.
Credit Agricole S A, CMA CGM,
This information allows stakeholders in multimodal transport to improve costs, optimize investment, and offer $1.6M
TERTIUM
premium services. Traxens works with partners to establish standards to allow interoperability across the whole
shipping industry.

100 Big Data Analytics, June 2016


Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Verticals->Finance (14/19)
Company Details Funding Investors
Shipsy [Gurgaon, 2015]: Shipsy operated as a C2C logistics company, delivering parcels with its own fleet of riders
locally and had tie-ups with other logistic providers for inter-city services. It pivoted its business model in April'16 to
offer an analytics platform for enterprises in the logistics sector. Shipsy now provides architecture design for analytics,
systems for business processes and an analytics platform for insights. In sep'15, it raised an undisclosed angel $1M
investment from a group of angel investors led by Dheeraj Jain, partner at Redcliffe Capital, a UK-based hedge fund
with participation from Nittin Passi, Ankit Jhunjhunwala and Vishal Chandra besides Udaan Angel Partners. Raised
$1M round from DTDC which picked up a 20% stake in the company in april'16.
Transmetrics [Sofia, 2013]: Transmetrics helps freight companies to predict future volume of shipments for the next
2-6 weeks, which enables companies to eliminate empty capacity before it occurs. This leads to multi-million savings
and on top of that reduces the amount of CO2 emissions produced, lowers the amount of fossil fuels used by the
$670k Launchub
transport industry, reduces traffic on the roads and lowers prices for freight shipments. Transmetrics’ prediction
approach is based on advanced statistical methods, machine learning and data mining algorithms. Its pricing follows
monthly pay-as-you go model with companies paying only for what they actually use in prediction.
Propel IT, Inc. [Pittsburgh, 2008]: Propel is a behavioral analytics company providing efficiency and productivity
tools for multiple sectors within the transportation and logistics industries. It is a fuel efficiency system for trucking
fleets, takes data using a truck's existing telematics system on a daily or weekly basis to measure driving behaviors
and lower users' trucks' fuel cost. Automates the process of understanding fuel efficiency and gives users ways to Innovation Works
increase fuel efficiency through its product FuelOpps. Other products include SafetyOpps - solution for risk and
incident reduction, RetentionOpps - for reducing the attrition rate in the transportation industry and EmissionOpps -
assists clients with their sustainability and carbon credit exchange needs.
Algorhythm Tech [Pune, 1999]: Algorhythm is a Cloud Computing Products Company engaged in the creation of
tecnology and business products for mass use. Products are AppliFIRE – a browser based Rapid Application
Development / DevOps Platform that generates high quality auto-generated code helping in developing sophisticated
Enterprise class Web or Mobile applications by simple drag &; drop. BeatZ - is the last mile optimizer for distributors Mumbai Angels
of FMCG Companies. It generates optimal sales beats, salesman routes etc. and helps free up Salesmen's time. Key
customers include Unilever, Reckitt Benckiser, Britannia etc. RoutZ - Daily Route Planner (pay per use SaaS offering). It
optimizes the vehicle utilization, distance traveled etc. and helps track delivery status.

101 Big Data Analytics, June 2016


Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Verticals->Finance (15/19)
Company Details Funding Investors
Newton Insight [Plano, 2011]: Newton Insight is a software as a service (SaaS) supply chain management (SCM)
company that provides businesses with the solutions they need to mange and improve the transportation and logistics
of their time-sensitive and immediate deliveries. Newton Insight SaaS solution provides businesses with the means to
link the systems of their multiple delivery service vendors into one platform. This allows them to coordinate, track,
and ensure the couriers made the deliveries to the right place at the right time in the right condition. It automates the
transportation and logistics processes that lowers costs and improves efficiencies. In addition, Newton Insight gives
businesses performance metrics to help them hold their couriers/logistics vendors accountable. Through various
mathematical models and algorithms, they aggregate turnaround times (TAT), costs, and complaints to arrive at a Tech Wildcatters
standard performance metric specific to the business. No two are alike. Lastly, Newton Insight offers businesses
analytic modules to help them understand their operations. They can run various reports for multiple indices to
uncover bottle necks in their processes. Whether it is the time it takes to dispatch an order, complete a delivery, to
the costs or late deliveries for a certain location, Newton Insight finds the issues. Businesses now have a chance to
address these noticeable and often times hidden problems so that they can improve their performance. Newton
Insight provides businesses with real solutions. With it they can make better decisions, and improve their operations
and service.
Cubic [San Diego, 1951]: Cubic Corporation operates two subsidiaries Cubic Transportation Systems (CTS) and Cubic
Global Defense (CGD). CTS enables transport agencies by providing payment, data and traffic management solutions.
CTS enables 24 billion transportation payment transactions each year for 450 transport partners and serves over 38
million people every day. CGD designs, develops Training Systems and Solutions, Range Designs, communication
products, tactical training programs,mission rehearsal exercises, operational and technical support to government
agencies.Has presence in nearly 60 countries.
Citi Logik [London, 2011]: Citi Logik has developed a big data platfrom to address the planning and operational needs
of smarter cities. The platform captures and displays anonymised mobile phone data in near real time for analysis by
experts in transport and town planning. Citi Logik comprises three divisions: Data processing division, Enterprise
service division- manages complex assignments which require the deployment of professional services, predictive
analytics and anonymised network data; and business solution division- delivers replicable data offerings and services
to the Intelligent Mobility, Smarter Cities and Built Environment markets. Has partnership with esri UK, Tracsis, and
AWS.

102 Big Data Analytics, June 2016


Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Verticals->Finance (16/19)
Company Details Funding Investors

ntrepo [Cape Town, 2015]: ntrepo provides management tools for freight forwarders, importers and exporters for
better management of their freight expenditure. For shippers, it offers rate management, tender automation, instant
quotes and business intelligence. And offers routing/pricing optimization and price benchmarking for the latter.

xRoute [Dallas, 2015]: xRoute provides a mobile app for truckers and fleet owners for better freight analytics. The
app provides information on trip costs of alternate routes, speed limit changes, rest pits, fuel recommendations and
post trip analystics. In addition, it also provides truckers tools for messaging. It also plans to work with government to
understand freight corridor performances and truck bottlenecks.

Next Generation [Illinois City, 1988]: Next Generation Logistics is a provider of supply chain services and technology
solutions. The company has 3 distinct divisions focusing on outsourced managed transportation services,
transportation management software (TMS), and supply chain network optimization studies. Offers FreightMaster
TMS and Dynamics TMS: enterprise transportation management planning and execution solutions that enable
shippers to rapidly manage their own inbound, outbound, and transfer freight. The solutions can be deployed On-
Premise, Hosted On-Premise or as a Hosted SaaS solution with scalable, global capabilities to fit any size organization.

Cheetah Software Systems [Westlake Village, 1987]: Cheetah Software Systems provides software platform for
logistics operations optimization. Its systems can run in state of art data centers on popular hardware platforms like
IBM,CISCO, F5 etc. It provdes several modules for optimizing logistics operations like Logistics framework, Routing,
Route Editor, Tracks, Reporting, Warehouse, Asset management etc.

Fortigo [Austin, 2000]: Fortigo provides SaaS solutions for automating, optimizing, and auditing logistics processes.
Integrates with existing ERP or WMS systems to provide return on investment by optimizing logistics processes,
minimizing ship-to-order times and streamlining collaboration with logistics providers. Process automation eliminates
duplicate data entry and streamlines resource utilization thereby reducing the overall cost.

103 Big Data Analytics, June 2016


Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Verticals->Finance (17/19)
Company Details Funding Investors

Grand Canal Solutions [San Jose, 2014]: Optimize, developed by Grand Canal is a SaaS platform that consolidates
historical supply chain and shipping data to analyze and visualize trends, and automatically recommends ways to
improve. It uses predictive analytics to understand future impact of investments and business decisions and identify
inefficiencies in supply chain. Customers include Harmonic, The BRIX Group, Krave Jerkey etc.

Loadtap [Mountain View, 2015]: Loadtap is a SaaS based dispatching and fleet management platform. Its primary
clients are small trucking companies. It provides tools for load management and load dispatching along with GPS
based mobile tracking. It also provides support for online documentation and attached paperworks associated with
any transportation request. Companies and its drivers can check load status, document capture for electronic
BOLs/PODs/receipts/other docs and upcoming schedule along with their work history.

Tilikin [San Francisco, 2015]: Tilikin is an enterprise SaaS, predictive analytics platform to help large Container-
Shipping companies understand how to optimally allocate and reposition their shipping containers around the world.

Coretex [Auckland, 2015]: Coretex develops and supplies fleet management solutions to transport operators. Formed
from the merger of International Telematics and Imarda, Coretex’s solutions include the ibright® system and the i360®
Action Engine. ibright® system is a single platform telematics solution which provides intelligence about fleet
movements and individual assets. ibright is utilised by customers operating fleets of refrigerated assets, trucks,
tractors, dry vans and rail assets. i360 Action Engine is designed for ease of use for small- to mid-range fleets, while
also providing sophisticated bespoke recording and reporting tools for enterprise level fleets. Apart from ibright®
system and the i360® Action Engine, the company also provides eRUC software which ensures compliance
requirements for transport organisations.

104 Big Data Analytics, June 2016


Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Verticals->Finance (18/19)
Company Details Funding Investors

Leanframe [Montreal, 2015]: Leanframe offers api driven fleet production schedule optimizer for the transportation
industry. It offers a platform for route planning and optimization, fleet scheduling, etc. The application takes into
account multiple variables to generate the optimal route and scheduling options. In beta as of Mar'16.

Axestrack [Jaipur, 2014]: A technology start-up working in the field of GPS based vehicle tracking solutions and its
exhaustive data analytics. The comprehensive data analytics solutions is used for consignment-tracking claim to have
helped various clients in accentuating their supply chain management. They claim to have about 33,000+ vehicles
with a quarterly growth of 40% and a network of 150+partners.

Acuitive Solutions [Charlotte, 2002]: Acuitive Solutions is a privately held, cloud-based supply chain software
company, headquartered in Charlotte, NC. Has developed a suite of multi-modal TMS portals which aids in decision-
making, improves data accuracy, and create accountability and visibility, automates the entire process from planning
to auditing. Handles international point-to-point rating and Chargeable Weight calculations based on volumetric (DIM)
conversions. Also provides seamless pre-payment audit services that can be integrated into the existing platform of a
third-party provider.

Agistix [San Mateo, 2004]: Agistix offers data and analytics tool for global supply chain execution, visibility &; event
management specifically designed to support all shipments, all modes of transportation - domestic, international,
inbound, outbound, and third party enabling clients to monitor, manage, and execute global supply chain activities in
real-time. It also provides professional services, including implementation, custom development, outsourced call
center management training, and best practice consulting.

Memex [Glasgow, ]: Memex provides data analytics and enterprise search technology for law enforcement agencies.
Also provides information processing and intelligence management offerings for the public security. Helps law firms to
enhance public safety, and prevent and deter crime, terrorism and other threats. Customers includes British Transport
police departments; Delaware, Michigan, New Hampshire, and Pennsylvania state police; Georgia Bureau of
Investigation; Kansas City Terrorism Early Warning Group; Northeast Ohio Regional Fusion Center; Los Angeles and
Philadelphia police departments; Central California Intelligence Center; Belize Police Department; Albania State
Police; and the United Nations Office on Drugs and Crime.
105 Big Data Analytics, June 2016
Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Verticals->Finance (19/19)
Company Details Funding Investors

Yidu Cloud [Beijing, 2014]: Yidu Cloud provides big data analytics to healthcare service providers for generating
insights from their medical data. Uses machine learning for medical data processing and deep mining, and provides
medical record research, diagnosis assistance for doctors, hospitals and research institutions. Claims that it has tied up
with 400 hospitals across China

FusionOps [Sunnyvale, 2005]: Fusionops provides SaaS Supply Chain Business Intelligence big data tool. Offering
1000+ pre-built supply chain reports, dashboards and analytics to visualize Inventory, Order fulfillment and Purchasing
metrics in real time. It targets all product manufacturing companies and has installations with global manufacturing Georgian Partners, New Enterprise
$44.6M
companies in the automotive, apparel, electronics, healthcare and consumer packaging industries. Customers include Associates, Sierra Ventures
major brands as Merck, Columbia Sportswear, Brocade, Mahindra, and Orica, among others. The company has also
announced its new FusionOps for Salesforce.com application and support for Oracle Enterprise Business Suite.

106 Big Data Analytics, June 2016


Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Services
Company Details Funding Investors
Gauge data solutions [Noida, 2013]: Gaugeanalytics is a Big Data consulting firm offering cutting edge knowledge
discovery &; predictive analytics solutions for a variety of verticals including Healthcare, Telecom, BFSI, Retail, Sports
and Law. Its data ingestion solutions make previously recorded data existing in silos and legacy systems accessible to
contemporary analytics platforms. Identifies new sources of data and creates the necessary infrastructure to collect it
based upon organization's needs, domain of operation, and expectations.

DataFactZ [Northville, 2005]: DataFactzz is a BigData services and technology company. It provides data Science
platforms which uses statistics, machine learning, deep learning, decision science, cognitive science, and business
intelligence capabilities to develop purpose based applications such as customer , supply chain, marketing and sales
analytics. Uses alll analytical techniques including Descriptive Analytics, Diagnostic Analytics, Predictive Analytics, and
Prescriptive Analytics. It employs a number of technologies in the area of Big Data and Advanced Analytics such as
DataStax (Cassandra), Databricks (Spark), Cloudera, Hortonworks, MapR, R, SAS, Matlab, SPSS and Advanced Data
Visualizations.

Clarity Solution Group [Chicago, 2004]: Clarity Solution Group provides consulting for bigdata analytics , business
intelligence, master data management and data governance solutions. It is an onshore US consultancy focused
exclusively on data and analytics. It offers full lifecycle solutions for brands across multiple sectors including financial
services, insurance, healthcare, and high technology.

107 Big Data Analytics, June 2016


Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Big Data Analytic Suites (1/2)


Company Details Funding Investors

QlikTech [Radnor, 1993]: Qlik provides self service data visualization and analytics solutions. It has two product
offerings Qlik View and Qlik Sense. The QlikView Business Discovery software platform enables organizations to Accel Partners, Jerusalem Venture
$12.5M
optimize data as a strategic resource. Qlik Sense offers data visualization tools for with multiple data access points and Partners
mobile applications. Qlik serves approximately 33,000 diverse customers worldwide in over 100 countries.

Ingensi [Paris, 2011]: Ingensi provides Big Data Analytics Platform based on Hadoop. Develop and Support three
products GridCity, Analytic Suite and Cloud Keeper. Analytic Suite integrates data from multiple data sources into a
single repository. Suite consists of tools for processing, cleaning and indexing all types of documents, databases and
business logs. Suite also provides visualization tools, predictive analytics and search engines. Gridcity is a big data
application which helps to visualize and monitor network infrastructure in real-time. Some of the areas where
GridCity is used are electricity, gas, optical fibre, telecoms, logistics etc.
Apervi [Irving, 2012]: Apervi is a bigdata integration platform for analytics applications involving streaming and batch
data. Users can built big data pipelines from sources like Hadoop, Spark etc using ready to use connectors and
operators leveraging a drag-n-drop GUI without having to write code. It can be used for data at rest (batch processing)
and real-time data (stream processing). Data workflows can be reused for moving from experimentation to
deployment. Apervi Conflux reduces the need to write custom code using technologies like MapReduce, Pig, SCALA,
Hive etc., by packaging all those features in the platform. CLients include AT&;T, Cloudera, Verizon, Databricks etc

DataSwarm [Madrid, 2015]: DataSwarm is an open source project that offers a distribution of web based Apache
Zeppelin Notebook with more interpreters and cluster management among other features that provides data
scientists the right tool to do big data analysis for their business. Apache Zeppelin is a multi-purpose notebook for
Data Ingestion, Data Discovery, Data Analytics, and Data Visualization &; collaboration. Zeppelin is undergoing
incubation at ASF. Zeppelin provides built-in Apache Spark integration. Zeppelin supports multiple language backend.

Fusionex [Kuala Lumpur, 2006]: Fusionex International PLC is an international provider of enterprise software
solutions and related services primarily focused on the high growth Asia Pacific region. The Company's software
solutions is focused on two sectors: core transactional systems and business intelligence.It provides a dashboard
analytics tool and data integration services for both structured and unstructured data at their disposal. Other products
include mobility, loyalty and cloud softwares. Fusionex works with many fortune 500 clients within Silicon Valley in
the United States, Europe as well as the Asia Pacific region.

108 Big Data Analytics, June 2016


Contextual Data Data Data Data Science Data Search Based Streaming Data Big Data
Data Preparation Verticals Services
Analytics Analytics discovery Platform Visualization Analytics Analytics Analytics Suite

Big Data Analytic Suites (2/2)


Company Details Funding Investors

Lavastorm [ , 1999]: Lavastorm Advanced Business Analytics software will help your business analysts accelerate time
to insight and prepare your data for better visualizations. It has automated data integration tools to clear and catalog
the data before visualisation.

MicroStrategy [Vienna, 1989]: Microstrategy offers a self service Analytics solutions focused on multi-source
analysis, by combining and blending data from multiple structured and unstructed sources. Simple drag-and-drop
tools based UI, intuitive visualizations and quick connections to any data source are combined with one-click sharing
of any insight. Inbuilt discovery and visualization tools helps in discovering related datasets without any pre-defined
data model. Also support Map Analytics and Map Visualization

augmentIQ [Pune, 2012]: Augmentiq is a big data technology and analytics company focusing on products for
consumer intelligence. Their flagship product is MAXIQ platform which can be used by the enterprises to process data
and information from all sources for fast analysis as well as business insights including solutions around 360 degree
view of customer, risk analytics, enterprise search &; text mining.

109 Big Data Analytics, June 2016


Team

Manish Jaiswal Harini Kancharana

Senior Analyst, Big Data Analytics Lead Analyst, Enterprise Infrastructure

110 Big Data Analytics, June 2016

You might also like