Filter by products, topics, and types of content

(6 Products)

(43 Topics)

(12 Industries)

(6 Types)

1 - 30 of 30 results
Show Summaries | Hide Summaries
View Results
Title none Type none Date down
An introduction to Kafka
An introduction to one of the most popular platforms for distributed messaging or streaming data.
Also available in: Chinese   Japanese  
Articles 05 Jun 2018
Code pattern: Mine insights from software development artifacts
There is a lot of unstructured text content that is generated in any domain – software development lifecycle, finance, healthcare, social media, etc. Valuable insights can be generated by analyzing unstructured text content and correlating the information across various document sources. This pattern uses Watson Natural Language Understanding, Python Natural Language Toolkit, OrientDB, Node-RED, and IBM Data Science Experience to build a complete analytics solution that generates insights for informed decision-making.
Articles 24 Jan 2018
Social power, influence, and performance in the NBA, Part 2: Exploring the individual NBA players
In this tutorial series, learn how to analyze how social media affects the NBA using Python, pandas, Jupyter Notebooks, and a touch of R. Part 2 explores individual athletes in the NBA: endorsement data, true on-the-court performance, and social power with Twitter and Wikipedia.
Also available in: Chinese   Japanese   Portuguese  
Articles 06 Sep 2017
Social power, influence, and performance in the NBA, Part 1: Explore valuation and attendance using data science and machine learning
In this tutorial series, learn how to analyze how social media affects the NBA using Python, pandas, Jupyter Notebooks, and a touch of R. Here in Part 1, learn the basics of data science and machine learning around the teams in the NBA.
Also available in: Chinese   Japanese   Portuguese  
Articles 31 Aug 2017
Need a recommendation engine? Graph databases boost customer service with real-time insight
Instead of thinking in terms of business-to-business (B2B) or business-to-consumer (B2C), the new paradigm is business-to-individual (B2I): you must know your customer, and tailor your services to their personal needs and preferences. But how do enterprises enable this paradigm shift in their approach to customer engagement? The answer is a game-changing piece of technology: the recommendation engine.
Blog 21 Feb 2017
Real-time multi-tenant migration with Cloudant NoSQL database
Relational SQL-based database systems can perform queries to alter a table or database, or run update a number of rows at a time. Cloudant and other NoSQL databases are schema-less. Thus, a migration script is essential in performing bulk operations. This blog provides tips on writing migration scripts and performing real-time tenant data migration without downtime.
Blog 21 Feb 2017
Navigating the world of modern data stores and NoSQL
Ever since the internet blew up the world of enterprise data and application silos in the mid-1990s, software engineers have been continuously challenged to integrate web solutions and platforms that were never meant to work together. Now 20 years later, those challenges are amplified by the expectations of an information-hungry world that demands ever-more intuitive interactions with the devices, data, information and applications they work with and play with every day.
Blog 21 Feb 2017
No more joins: An overview of Graph database query languages Alaa Mahmoud
Once you have your mind set on a particular graph database, the next question will be, "Which query language should I use?" Unlike SQL databases where you pretty much have only one choice, graph databases have numerous query languages, each of which is trying to solve a particular problem.
Blog 21 Feb 2017
Detecting complex fraud in real time with Graph databases
Graph databases such as IBM Graph can help prevent losses before they happen and limit the impact of fraud on your bottom line. In a graph database, both records and relationships are first-class citizens, which means that you can accomplish complex traversals from one record to another very quickly.
Blog 21 Feb 2017
Start your data science education with the Data Science Fundamentals Learning Path
This article describes a short, straightforward learning path to begin building your data science skills. The recently launched Data Science Fundamentals Learning Path at Big Data University guides you through no-charge online courses that prepare you to earn your IBM Data Science Foundations Level 1 and Level 2 badges.
Also available in: Portuguese   Spanish  
Articles 29 Nov 2016
Use IBM SPSS Statistics for business intelligence
IBM SPSS Statistics is an effective tool for Business Intelligence. This series of articles demonstrates the use of SPSS Statistics to extract information from raw data, identify techniques and strategies to enhance business performance, and forecast effects of dynamic market conditions. This first article in the series introduces sample raw data and generates frequency tables, graphs, crosstabs, and statistics. Business-level conclusions from statistical analysis are also explored in this article.
Also available in: Chinese  
Articles 17 Aug 2015
Perform sentiment analysis in a big data environment
Public opinion views about government policies are scattered across the Internet, in Twitter and News Feeds. People can express their views quickly and easily from mobile devices, which are ubiquitous. Sentiment analysis can be performed against the data that is gathered from these disparate sources (tweets, RSS feeds, and mobile apps). This data can be aggregated, transformed, or reformatted and then stored in a Hadoop Distributed File System (HDFS). This data can be processed and analyzed as data at rest (data lake) or as data in motion (data stream).
Also available in: Chinese   Russian   Japanese  
Articles 23 Jul 2015
TED Talk replay: What do we do with all this big data?
In this TED Talk replay, Susan Etlinger explains why, as we receive more and more data, we need to deepen our critical thinking skills. Because it's hard to move beyond counting things to really understanding them.
Videos 28 Oct 2014
Integrate the BIRT Viewer with Java and Java EE web applications
Learn to integrate the Business Intelligence and Reporting Tools (BIRT) Report viewer with a custom Java and Java Platform, Enterprise Edition (Java EE) web application. Also, use the features of the BIRT report viewer application in custom web applications.
Also available in: Chinese   Japanese  
Tutorial 02 Sep 2014
IBM Cognos Business Intelligence
Download IBM Cognos Business Intelligence Developer Edition V10.2.1. IBM Cognos BI Developer Edition enables you to learn a rich set of business intelligence capabilities without incurring any upfront costs. Access any and all data sources and use reporting and analysis to experiment with how to deliver relevant information how, when, and where it is needed. Broaden your skills and learn how to build BI applications based on enterprise-class SOA foundation. Expand your opportunities in BI and become an IBM partner.
Also available in: Chinese  
Trial Downloads 02 Jul 2014
Using IBM Big SQL over HBase, Part 1: Creating tables and loading data
With IBM's Big SQL technology, you can use InfoSphere BigInsights to query HBase using industry-standard SQL. This two-part series focuses on creating tables, data-loading methods, and query handling. Here in Part 1, learn fundamental usage of IBM's Big SQL technology for Hadoop over HBase by creating tables and examining ways to load data. Follow a basic storyline of migrating a relational table to HBase using Big SQL. Part 2 explores query handling, and how to connect to Big SQL via JDBC to run business intelligence and reporting tools, such as BIRT and Cognos.
Also available in: Chinese  
Articles 18 Feb 2014
Process small, compressed files in Hadoop using CombineFileInputFormat
This article provides detailed examples that show you how to extend and implement CombineFileInputFormat to read the content of gzip (default codec) files at runtime. Learn how to use CombineFileInputFormat within the MapReduce framework to decouple the amount of data a Mapper consumes from the block size of the files in HDFS.
Also available in: Chinese   Japanese  
Articles 11 Feb 2014
IBM Entrepreneur Week
IBM Entrepreneur Week is a one-of-a-kind opportunity for you to meet, interact, and connect with entrepreneurs, venture capitalists, industry leaders, and academics from around the world. If you're a startup or entrepreneur, join us online for our inaugural IBM Entrepreneur Week, 3-7 Feb 2014. There will be events taking place online and in locations worldwide, including face-to-face and virtual mentoring sessions, a women entrepreneur-focused event, and a LiveStream broadcast of the SmartCamp Global Finals in San Francisco.
Articles 15 Jan 2014
Oozie workflow scheduler for Hadoop
Big data in its raw form rarely satisfies the Hadoop developer's data requirements for performing data processing tasks. Different extract/transform/load (ETL) and pre-processing operations are usually needed before starting any actual processing jobs. Oozie is a framework that helps automate this process and codify this work into repeatable units or workflows that can be reused over time without the need to write any new code or steps. Learn how Oozie can be used to create different types of workflows.
Also available in: Russian   Japanese  
Articles 19 Nov 2013
Big data serialization using Apache Avro with Hadoop
Apache Avro is a serialization framework that produces data in a compact binary format that doesn't require proxy objects or code generation. Get to know Avro, and learn how to use it with Apache Hadoop.
Also available in: Russian  
Articles 29 Oct 2013
Big data architecture and patterns, Part 2: How to know if a big data solution is right for your organization
This article describes a dimensions-based approach for assessing the viability of a big data solution. By answering questions that explore each dimension, apply what you know about your own environment to determine whether a big data solution is appropriate. A careful look at each dimension yields clues about whether it's time for your big data services to evolve.
Also available in: Chinese   Russian   Portuguese   Spanish  
Articles 08 Oct 2013
Optimization in R
Many statistical techniques involve optimization. The path from a set of data to a statistical estimate often lies through a patch of code whose purpose is to find the minimum (or maximum) of a function. Likelihood-based methods (such as structural equation modeling, or logistic regression) and least squares estimates all depend on optimizers for their estimates and for certain goodness-of-fit tests. Base-R offers the optim function for general-purpose optimization. Through a conversation with John Nash, author and maintainer of optim and the newer optimx, learn about the pitfalls of optimization and some of the tools that R offers.
Articles 07 Oct 2013
Build a data warehouse with Hive
The data warehouse has been an ongoing battle among organizations for years. How do you build it? What data can you integrate? Should you use Kimball or Inmon, corporate information factory (CIF), or data marts? The list could go on for days -- decades, even. With big data, the questions become far more complicated, such as is a data warehouse enough? The answer lies in the enterprise. People claim that Hive is the data warehouse of Hadoop. Although true on one level, it's also something of a false claim. Sometimes, however, you have to use the tools available to you, and for that, Hive can be a data warehouse.
Also available in: Chinese   Russian  
Articles 25 Jun 2013
What's the big deal about Big SQL?
If you specialize in relational database management technology, you've probably heard a lot about "big data" and the open source Apache Hadoop project. Perhaps you've also heard about IBM's new Big SQL technology, which enables InfoSphere BigInsights users to query Hadoop data using industry-standard SQL. Curious? This article introduces you to Big SQL, answering many of the common questions that relational DBMS users have about this IBM technology.
Also available in: Chinese   Russian  
Articles 14 Jun 2013
InfoSphere Streams
Download a free trial version of InfoSphere Streams, a high-performance analytics platform that allows user-developed applications to rapidly ingest, analyze, and correlate information as it arrives from thousands of real-time sources.
Also available in: Chinese   Portuguese  
Trial Downloads 18 Apr 2013
B2B customer segmentation
Learn the process of developing a business-to-business customer segmentation, including the challenges of segmenting business customers and important differences from more common consumer segmentations. Consider methodologies and suggestions on how to work closely with business users on implementation.
Also available in: Spanish  
Articles 30 Oct 2012
SPSS Text Analytics for Surveys
Download a free version of SPSS Text Analytics for Surveys, uses powerful natural language processing technologies specifically designed for survey text.
Also available in: Chinese   Portuguese  
Trial Downloads 11 Aug 2011
Migrating InfoSphere Streams SPADE applications to Streams Processing Language, Part 1: Migrate basic SPADE applications
The most significant new feature of Version 2.0 of the IBM InfoSphere(R) Streams product is the programming language model transformation from Streams Processing Application Declarative Engine (SPADE) to Streams Processing Language (SPL). Users with SPADE applications from previous versions will need to migrate and port their applications to SPL when upgrading their installations to Version 2.0. This tutorial is Part 1 of a 5-part series that uses actual SPADE samples to demonstrate a series of step-by-step procedures for migrating and porting different types of SPADE application content. Part 1 demonstrates the migration of basic SPADE applications.
Tutorial 19 May 2011
An introduction to the Hadoop Distributed File System
The Hadoop Distributed File System (HDFS)--a subproject of the Apache Hadoop project--is a distributed, highly fault-tolerant file system designed to run on low-cost commodity hardware. HDFS provides high-throughput access to application data and is suitable for applications with large data sets. This article explores the primary features of HDFS and provides a high-level view of the HDFS architecture.
Also available in: Chinese   Japanese   Portuguese  
Articles 01 Feb 2011
What is PMML?
The Predictive Model Markup Language (PMML) is the de facto standard language used to represent predictive analytic models. It allows for predictive solutions to be easily shared between PMML compliant applications. With predictive analytics, the Petroleum and Chemical industries create solutions to predict machinery break-down and ensure safety. PMML is supported by many of the top statistical tools. As a result, the process of putting a predictive analytics model to work is straightforward since you can build it in one tool and instantly deploy it in another. In a world in which sensors and data gathering are becoming more and more pervasive, predictive analytics and standards such as PMML make it possible for people to benefit from smart solutions that will truly revolutionize their lives.
Also available in: Chinese   Portuguese   Portuguese   Spanish  
Articles 28 Sep 2010
1 - 30 of 30 results
Show Summaries | Hide Summaries