Sentiment Analysis Media monitoring and text analysis are automated methods for retrieving insights from large unstructured or semi-structured data, such as transaction data, social media, blogs, and web news sites. …
Machine Learning
Machine Learning explores the construction and study of learning algorithms. It can be viewed as a union of unsupervised learning methods, more commonly referred to as data mining, and supervised …
Big Data Characterized in terms of V’s
Volume: Refers to the amount of data. Big Data often has thousands of entities or elements in billions of records. Velocity: Refers to the speed at which data is captured, …
Data Science – Process and Iterative Phases
The Data Science process follows the scientific method of refining knowledge by making observations, formulating and testing hypotheses, observing results, and formulating general theories that explain results. Within Data Science, …
Data Science – Dependency
Developing Data Science solutions involves the iterative inclusion of data sources into models that develop insights. Data Science depends on: Rich Data Sources: Data with the potential to show otherwise …
Big Data and Data Science – A Glance
Big Data is produced through email, social media, online orders, and even online video games. Data is generated not only by phones and point-of-sale devices, but also by surveillance systems, …
Data Quality – Policy and Metrics
Data Quality – Policy Data Quality efforts should be supported by and should support data governance policies. For example, governance policies can authorize periodic quality audits and mandate compliance to …
Data Quality Program – Readiness Assessment / Risk Assessment
Findings from a readiness assessment will help determine where to start and how quickly to proceed. Findings can also provide the basis for roadmapping program goals. If there is strong …
Data Quality – Audit Code Module and Metrics
Quality Check and Audit Code Modules Create shareable, linkable, and re-usable code modules that execute repeated data quality checks and audit processes that developers can get from a library. If …
Data Quality – Preventive and Corrective Actions
Preventive Actions The best way to create high quality data is to prevent poor quality data from entering an organization. Preventive actions stop known errors from occurring. Inspecting data after …