در این مقاله لیستی از ابزارهای پرکاربرد حوزه کلان داده را مشاهده می کنید
COMPANY |
PRODUCT |
DESCRIPTION |
FREE TRIAL |
WEBSITE |
1010data |
Insights Platform |
Data management, analysis,modeling, reporting, visualization, and RAD apps |
Available by request |
1010data.com/products/ i nsights-platform/analysis- modeling |
Actian |
Vector |
DBMS, column store, analytics platform |
30 days |
actian.com/analytic- database/vector-smp- analytic-database |
Aginity |
Aginity Amp |
Data analytics management platform |
Demo available by request |
aginity.com/amp-overview |
Alation |
Alation |
Enterprise data collaboration and analytics platform |
Demo available by request |
alation.com/product |
Alluxio Open Foundation |
Alluxio |
Distributed storage system across all store types |
Open source |
alluxio.org |
Alpine Data |
Alpine Chorus 6 |
Data science, ETL, predictive analytics, and execution workflow design and management |
Demo available by request |
alpinedata.com/product/ |
Alteryx |
Alteryx Analytics Platform |
ETL, predictive analytics, spatial analytics, automated workflows, reporting, and visualization |
Available by request |
alteryx.com/products/alteryx- designer |
Amazon Web Services |
Amazon Kinesis |
Stream data ingestion, storage, query, and analytics PaaS |
N/A |
aws.amazon.com/kinesis |
Amazon Web Services |
Amazon Machine Learning |
Machine learning algorithms-as-a-service,ETL, data visualization, modeling and management APIs, batch and realtime predictive analytics |
N/A |
aws.amazon.com/machine- learning |
Apache Foundation |
Ambari |
Hadoop cluster provisioning, management, and monitoring |
Open source |
ambari.apache.org |
Apache Foundation |
Apex |
Stream and batch processing on YARN |
Open source |
apex.apache.org |
Apache Foundation |
Avro |
Data serialization system (data structure, binary format, container, RPC) |
Open source |
avro.apache.org |
Apache Foundation |
Beam |
Programming model for batch and streaming data processing |
Open source |
beam.apache.org |
Apache Foundation |
Crunch |
Java library for writing, testing, running MapReduce pipelines |
Open source |
crunch.apache.org |
COMPANY |
PRODUCT |
DESCRIPTION |
FREE TRIAL |
WEBSITE |
Apache Foundation |
Drill |
Distributed queries on multiple data stores and formats |
Open source |
|
Apache Foundation |
Falcon |
Data governance engine for Hadoop clusters |
Open source |
|
Apache Foundation |
Flink |
Streaming dataflow engine for Java |
Open source |
|
Apache Foundation |
Flume |
Streaming data ingestion for Hadoop |
Open source |
|
Apache Foundation |
Giraph |
Iterative distributed graph processing framework |
Open source |
|
Apache Foundation |
GraphX |
Graph and collection processing on Spark |
Open source |
|
Apache Foundation |
GridMix |
Benchmark for Hadoop clusters |
Open source |
|
Apache Foundation |
Hadoop |
MapReduce implementation |
Open source |
|
Apache Foundation |
Hama |
Bulk synchronous parallel (BSP) implementation for big data analytics |
Open source |
|
Apache Foundation |
HAWQ |
Massively parallel SQL on Hadoop |
Open source |
|
Apache Foundation |
HDFS |
Distributed file system (Java-based, used by Hadoop) |
Open source |
|
Apache Foundation |
Hive |
Data warehousing framework on YARN |
Open source |
|
Apache Foundation |
Ignite |
In-memory data fabric |
Open source |
|
Apache Foundation |
Impala |
Distributed SQL on YARN |
Open source |
i mpala.apache.org |
Apache Foundation |
Kafka |
Distributed pub-sub messaging |
Open source |
|
Apache Foundation |
MADlib |
Big data machine learning in SQL |
Open source |
|
Apache Foundation |
Mahout |
Machine learning and data mining on Hadoop |
Open source |
|
Apache Foundation |
Mesos |
Distributed systems kernel (all compute resources abstracted) |
Open source |
|
Apache Foundation |
Oozie |
Workflow scheduler (DAGs) for Hadoop |
Open source |
|
Apache Foundation |
ORC |
Columnar storage format |
Open source |
|
Apache Foundation |
Parquet |
Columnar storage format |
Open source |
|
COMPANY |
PRODUCT |
DESCRIPTION |
FREE TRIAL |
WEBSITE |
Apache Foundation |
Phoenix |
OLTP and operational analytics for Apache Hadoop |
Open source |
|
Apache Foundation |
Pig |
Turns high-level data analysis language into MapReduce programs |
Open source |
|
Apache Foundation |
Samza |
Distributed stream processing framework |
Open source |
|
Apache Foundation |
Spark |
General-purpose cluster computing framework |
Open source |
|
Apache Foundation |
Spark Streaming |
Discretized stream processing with Spark's RDDs |
Open source |
|
Apache Foundation |
Sqoop |
Bulk data transfer between Hadoop and structured datastores |
Open source |
|
Apache Foundation |
Storm |
Distributed realtime (streaming) computing framework |
Open source |
|
Apache Foundation |
Tez |
Dataflow (DAG) framework on YARN |
Open source |
|
Apache Foundation |
Thrift |
Data serialization framework (full-stack) |
Open source |
|
Apache Foundation |
YARN |
Resource manager (distinguishes global and per- app resource management) |
Open source |
hadoop.apache.org/docs/ r2.7.1/hadoop-yarn/hadoop- yarn-site/YARN.html |
Apache Foundation |
Zeppelin |
Interactive data visualization |
Open source |
|
Apache Foundation |
ZooKeeper |
Coordination and state management |
Open source |
|
Attunity |
Attunity Visibility |
Data warehouse and Hadoop data usage analytics |
Demo available by request |
|
Attunity |
Attunity Replicate |
Data replication, ingestion, and streaming platform |
Available by request |
|
BigML |
BigML |
Predictive analytics server and development platform |
N/A |
|
Bitam |
Artus |
Business intelligence platform |
Available by request |
|
Board |
BOARD All in One |
BI, analytics, and corporate performance management platform |
Demo available by request |
|
CAPSENTA |
Ultrawrap |
Database wrapper for lightweight data integration |
Available by request |
|
Cask Data |
Cask |
Containers (i.e. data, programming, application) on Hadoop for data lakes |
Demo available by request |
|
Cask Data |
Cask Data App Platform |
Analytics platform for YARN with containers on Hadoop, visual data pipelining, and data lake metadata management |
Free tier available |
|
COMPANY |
PRODUCT |
DESCRIPTION |
FREE TRIAL |
WEBSITE |
Cazena |
Cazena |
Cloud-based data science platform |
Demo available by request |
|
|
|
Simple JavaScript charting library |
Open source |
|
Cirro |
Cirro Data Cloud |
Database management system for cloud databases |
Demo available |
cirro.com/#/#product |
Cisco |
Cisco Edge Fog Fabric |
IoT and streaming data analytics |
N/A |
cisco.com/c/en/us/products/ analytics-automation- software/edge-analytics- fabric |
Cloudera |
Cloudera Enterprise Data Hub |
Predictive analytics, analytic database, and Hadoop distribution |
Demo available by request |
|
Confluent |
Confluent Platform |
Data integration, streaming data platform |
Free tier available |
|
|
|
Declarative-flavored JavaScript visualization library |
Open source |
|
Databricks |
Databricks |
Data science (ingestion, processing, collaboration, exploration, and visualization) on Spark |
14 days |
|
Dataguise |
Dataguise DgSecure |
Big data security monitoring |
Available by request |
|
Dataiku |
Dataiku DSS |
Collaborative data science platform |
14 days |
|
Datameer |
Datameer |
BI, data integration, ETL, and data visualization on Hadoop |
Available by request |
|
DataRobot |
DataRobot |
Machine learning model-building platform |
Demo available by request |
|
DataRPM |
DataRPM |
Cognitive predictive maintenance for industrial IoT |
Available by request |
|
DataTorrent |
DataTorrent RTS |
Stream and batch (based on Apache Apex) application development platform |
Free tier available |
|
DataWatch |
DataWatch Monarch |
Data extraction and wrangling, self-service analytics, streaming visualization |
30 days |
|
Disco Project |
Disco |
MapReduce framework for Python |
Open source |
|
Domo |
Domo |
Data integration, preparation, and visualization |
Available by request |
|
Druid |
Druid |
Columnar distributed data store w/realtime queries |
Open source |
|
Eclipse Foundation |
BIRT |
Visualization and reporting library for Java |
Open source |
|
|
EngineRoom |
Geospatial, data transformation and discovery, modeling, predictive analytics, and visualization |
N/A |
|
COMPANY |
PRODUCT |
DESCRIPTION |
FREE TRIAL |
WEBSITE |
EnThought |
SciPy |
Scientific computing ecosystem (multi- dimensional arrays, interactive console, plotting, symbolic math, data analysis) for Python |
Open source |
|
Exaptive |
Exaptive |
RAD and application marketplace for data science |
Free tier available |
|
Exasol |
Exasol |
In-memory analytics database |
Free tier available |
|
|
Presto |
Distributed interactive SQL on HDFS |
Open source |
|
Fair Isaac Corporation |
FICO Decision Management Suite |
Data integration, analytics, and decision management |
N/A |
|
GFS2 Group |
GFS |
(Global File System) Shared-disk file system for Linux clusters |
Open source |
|
GoodData |
GoodData Platform |
Data distribution, visualization, analytics (R, MAQL), BI, and warehousing |
N/A |
|
|
Protocol Buffers |
Data serialization format and compiler |
Open source |
|
|
TensorFlow |
An open-source software library for machine intelligence |
Open source |
|
Graphviz |
Graphviz |
Graph visualization toolkit |
Open source |
|
|
|
Stats, machine learning, and math runtime for big data |
Free tier available |
|
|
H2O |
Open-source prediction engine on Hadoop and Spark |
Open source |
|
Hitachi Group |
Pentaho |
Data integration layer for big data analytics |
30 days |
|
Hortonworks |
Hortonworks Data Platform |
Hadoop distribution based on YARN |
N/A |
|
Hortonworks |
Hortonworks DataFlow |
Streaming data collection, curation, analytics, and delivery |
N/A |
|
IBM |
IBM BigInsights |
Scalable data processing and analytics on Hadoop and Spark |
Available by request |
ibm.com/analytics/us/en/ technology/biginsights |
IBM |
IBM Streaming Analytics |
Streaming data application development and analytics platform |
Available by request |
|
IBM |
IBM InfoSphere Information Server |
Data integration, data quality, and data governance |
Available by request |
ibm.com/analytics/ information-server |
Ignite |
Infobright DB |
Column-oriented store with semantic indexing and approximation engine for analytics |
N/A |
ignitetech.com/solutions/ information-technology/ infobrightdb |
Infor |
Birst |
Enterprise and embedded BI and analytics platform |
Available by request |
|
COMPANY |
PRODUCT |
DESCRIPTION |
FREE TRIAL |
WEBSITE |
Informatica |
Enterprise Data Lake |
Collaborative, centralized data lake, data governance |
N/A |
i nformatica.com |
Informatica |
Big Data Management |
Data integration platform on Hadoop |
N/A |
i nformatica.com |
Informatica |
Relate 360 |
Big Data analytics, visualization, search, and BI |
N/A |
i nformatica.com |
Informatica |
Big Data Streaming |
Event processing and streaming data management for IoT |
N/A |
i nformatica.com |
Information Builders |
WebFOCUS |
BI and analytics |
Demo available by request |
|
Information Builders |
Omni-Gen |
Data management, quality, integration platform |
Available by request |
|
Intersystems |
IRIS |
Data mangement, interoperability, and analytics |
N/A |
i ntersystems.com/products/ i ntersystems-iris/#technology |
Java-ML |
Java-ML |
Various machine learning algorithms for Java |
N/A |
j ava-ml.sourceforge.net |
Jinfonet |
JReport |
Visualization, embedded analytics for web apps |
Available by request |
j infonet.com/product |
JUNG Framework |
JUNG Framework |
Graph framework for Java and data modeling, analyzing, and visualizing |
Open source |
j ung.sourceforge.net |
Kognitio |
Kognitio Analytical Platform |
In-memory, MPP, SQL and NoSQL analytics on Hadoop |
Free tier available |
|
Lavastorm |
Lavastorm Server |
Data preparation, analytics application development platform |
Free tier available |
|
LexisNexis |
LexisNexis Customer Information Management |
Data management and migration |
N/A |
risk.lexisnexis.com/ corporations-and-non- profits/customer-information- management |
LexisNexis |
HPCC Platform |
Data management, predictive analytics, and Big Data workflow |
Open source |
|
Liaison Technologies |
Liaiason Alloy |
Data management and integration |
Demo available by request |
|
Lightbend |
Lightbend Reactive Platform |
JVM application development platform with Spark |
Free tier available |
|
|
Pinot |
Real-time OLAP distributed data store |
Open source |
|
LISA Lab |
Theano |
Python library for multi-dimensional array processing w/GPU optimizations |
Open source |
|
Loggly |
Loggly |
Cloud log management and analytics |
30 days |
|
Logi Analytics |
Logi Analytics Platform |
Embedded BI and data discovery |
Demo available by request |
COMPANY |
PRODUCT |
DESCRIPTION |
FREE TRIAL |
WEBSITE |
Looker |
Looker Business Intelligence |
Data analytics and business intelligence platform |
Demo available by request |
|
Looker |
Looker Embedded Analytics |
Embedded analytics, data exploration, and data delivery |
Demo available by request |
|
MapR |
MapR Event Streams |
Global publish-subscribe event streaming system |
Free tier available |
|
MapR |
MapR Analytics and Machine Learning Engines |
Real-time analytics and machine learning at scale |
Free tier available |
|
MapR |
MapR Converged Data Platform |
Big Data platform on enterprise-grade Hadoop distribution with integrated open-source tools (Spark, Hive, Impala, Solr, etc.), NoSQL (document and wide column) DBMS |
Free tier available |
|
Micro Focus |
ArcSight Data Platform |
Data collection and log management platform |
Available by request |
software.microfocus.com/ en-us/products/siem-data- collection-log-management- platform/overview |
Micro Focus |
IDOL |
Machine learning, enterprise search, and analytics platform |
N/A |
|
Micro Focus |
Vertica |
Distributed analtyics database and SQL analytics on Hadoop |
Free tier available |
|
Microsoft |
SSRS |
SQL Server reporting (server-side) |
Free tier available |
|
Microsoft |
Azure Machine Learning Studio |
Predictive analytics and machine learning development platform |
Free tier available |
azure.microsoft.com/en-us/ services/machine-learning- studio |
Microsoft |
Power BI |
Business intelligence platform |
Free tier available |
|
MicroStrategy |
Advanced Analytics |
Predictive analytics, native analytical functions, data mining |
Free tier available |
microstrategy.com/us/ products/capabilities/ advanced-analytics |
New Relic |
New Relic Insights |
Real-time application performance analytics |
Demo available by request |
|
NumFocus |
Julia |
Dynamic programming language for scientific computing |
Open source |
j ulialang.org |
NumFocus |
Matplotlib |
Plotting library on top of NumPy (like parts of MATLAB) |
Open source |
|
NumFocus |
NumPy |
Mathematical computing library (i.e. multi- dimensional arrays, linear algebra, Fourier transforms) for Python |
Open source |
|
NumFocus |
Pandas |
Data analysis and modeling for Python |
Open source |
|
Objectivity |
ThingSpan |
Graph analytics platform with Spark and HDFS integration |
Free tier available |
COMPANY |
PRODUCT |
DESCRIPTION |
FREE TRIAL |
WEBSITE |
OpenText |
OpenText Big Data Analytics |
Analytics and visualization with analyatics server |
45 days |
opentext.com/what-we-do/ products/analytics/opentext- big-data-analytics |
OpenTSDB Authors |
OpenTSDB |
Time-series database on Hadoop |
Open source |
|
Oracle |
Big Data Discovery |
Big Data analytics and visualization platform on Spark |
Demo available |
|
Oracle |
R Advanced Analytics for Hadoop |
R interface for manipulating data on Hadoop |
N/A |
oracle.com/technetwork/ database/database- technologies/bdc/r- advanalytics-for-hadoop/ overview |
Palantir |
Gotham |
Cluster data store, on-the-fly data integration, search, in-memory DBMS, ontology, and distributed key-value store |
N/A |
|
Palantir |
Foundry |
Data integration platform |
N/A |
|
Panoply |
Panoply |
Data management and analytics platform |
21 days |
|
Panorama Software |
Necto |
Business intelligence, visualization, and data management |
Available by request |
|
Paxata |
Paxata Adaptive Information Platform |
Data integration, preparation, exploration, visualization on Spark |
Demo available by request |
|
Pepperdata |
Pepperdata Cluster Analyzer |
Big data performance analytics |
Demo available by request |
|
Pivotal |
Pivotal Greenplum |
Open-source data warehouse and analytics |
Open source |
|
Pivotal |
Spring Cloud Data Flow |
Cloud platform for building streaming and batch data pipelines and analytics |
N/A |
|
Prognoz |
Prognoz Platform |
BI and analytics (OLAP, time series, predictive) |
Free tier available |
|
Progress Software |
DataDirect Connectors |
Data integration: many-source, multi-interface (ODBC, JDBC, ADO.NET, OData), multi- deployment |
Available by request |
|
Project Jupyter |
Jupyter |
Interactive data visualization and scientific computing on Spark and Hadoop |
Open source |
j upyter.org |
Pyramid Analytics |
BI Office |
Data discovery and analytics platform |
Free tier available |
|
Qlik |
Qlik Sense |
Data visualization, integration, and search |
Free tier available |
|
Qlik |
Qlik Analytics Platform |
Data visualization platform |
Free tier available |
|
COMPANY |
PRODUCT |
DESCRIPTION |
FREE TRIAL |
WEBSITE |
Qlik |
QlikView |
Business intelligence application platform |
Free tier available |
|
Qubole |
Qubole Data Service |
Data engines for Hive, Spark, Hadoop, Pig, Cascading, Presto on AWS, Azure, Google Cloud |
Free tier available |
|
Rapid7 |
InsightOps |
Log management and analytics |
Available by request |
|
RapidMiner |
RapidMiner Studio |
Predictive analytics workflow and model builder |
Free tier available |
|
RapidMiner |
RapidMiner Radoop |
Predictive analytics on Hadoop and Spark with R and Python support |
Free tier available |
|
Red Hat |
Ceph |
Distributed object and block store and file system |
Open source |
|
RedPoint |
RedPoint Data Management |
Data management, quality, integration (also on Hadoop) |
Demo available by request |
|
SAP |
SAP HANA |
In-memory, column-oriented, relational DBMS (cloud or on-premise) with text search, analytics, stream processing, R integration, and graph processing |
Free tier available |
|
SAS |
SAS Platform |
Analytics, BI, data management, and deep statistical programming |
Available by request |
|
Sencha |
InfoVis Toolkit |
JavaScript visualization library |
Open source |
|
Sisense |
Sisense |
Analytics, BI, visualization, and reporting |
Available by request |
|
Skytree |
Skytree |
Machine learning platform with self-service options |
Available by request |
|
Software AG |
Terracotta In- Memory Data Management by Software AG |
In-memory data management, job scheduler, Ehcache implementation, and enterprise messaging |
Available by request |
|
Splunk |
Splunk Enterprise |
Operational intelligence for machine-generated data |
60 days |
|
Stitch |
Stitch |
ETL-as-a-service |
Free tier available |
|
StreamSets |
Dataflow Performance Manager |
Data management and analytics platform |
N/A |
|
Sumo Logic |
Sumo Logic |
Log and time-series management and analytics |
30 days |
|
Tableau |
Tableau |
Interactive data visualization for BI |
Available by request |
|
Tableau |
Tableau Desktop |
Visualization, analytics, exloration (with self- service, server, hosted options) |
14 days |
|
Talend |
Talend Data Fabric |
Real-time or batch data management platform |
N/A |
COMPANY |
PRODUCT |
DESCRIPTION |
FREE TRIAL |
WEBSITE |
Talend |
Talend Open Studio |
ELT and ETL on Hadoop with open-source components |
N/A |
|
Tamr |
Tamr |
Data management, sanitation, analytics, and BI |
Demo available by request |
|
Targit |
Targit Decision Suite |
BI, analytics, discovery front-end with self- service options |
Demo available by request |
|
Teradata |
Teradata |
Data warehousing, analytics, data lake, SQL on Hadoop and Cassandra, big data appliances, R integration, and workload management |
N/A |
|
The R Foundation |
R |
Language and environment for statistical computing and graphics |
N/A |
|
Thoughtspot |
Thoughtspot |
Relational search engine |
Demo available by request |
|
TIBCO |
TIBCO Data Virtualization |
ETL, data virtualization, and integration platform |
N/A |
|
TIBCO |
Jaspersoft |
BI, analytics (OLAP, in-memory), ETL, data integration (relational and non- relational), reporting, and visualization |
Free tier available |
jaspersoft.com/business- intelligence-solutions |
TIBCO |
TIBCO Spotfire Platform |
Data mining and visualization |
30 days |
|
Treasure Data |
Treasure Data |
Analytics infrastructure as a service |
Demo available by request |
|
Trifacta |
Trifacta Wrangler |
Data wrangling, exploration, and visualization on Hadoop |
N/A |
|
University of Waikato |
Weka |
Machine learning and data mining for Java |
Open source |
|
Unravel |
Unravel |
Predictive analytics and machine learning performance monitoring |
Available by request |
unraveldata.com/optimize- troubleshoot-and-analyze- big-data-performance |
Waterline Data |
Waterline Data |
Data marketplace (inventory, catalogue with self-service) on Hadoop |
Demo available by request |
|
Wolfram |
Wolfram Language |
Knowledge-based programming language with many domain-specific libraries |
Available by request |
|
Workday |
Workday Prism Analytics |
Data preparation, discovery, and analytics on Hadoop and Spark |
N/A |
|
Xplenty |
Cascading |
Platform to develop big data applications on Hadoop |
Open source |
|