Articles
Abstract
: Nowadays, due to the flow of various types of data, there is an
appearance of data in large quantities. These data are difficult to use, analyze and
predict. The article presents information and technologies for processing large
amounts of data: Apache Hadoop, Atlas.ti, HPCC, Storm, Qubole Data, Apache
Cassandra, Stats iQ from Qualtrics, CouchDB, Pentaho, Apache Flink, Cloudera,
Open Refine, RapidMiner, data cleaner