Treasure Data CDP Resources

  • Filter by Resource Type
  • Articles
  • Blog
  • Case Studies
  • Cheatsheets
  • Reports
  • Webinars
  • Filter by Industry
  • Automotive
  • CPG
  • Entertainment & Media
  • Financial Services
  • Healthcare
  • Retail
  • Technology
  • Travel & Hospitality
  • Filter by Topic
  • AI & Machine Learning
  • CDP
  • CDP Use Cases
  • Company News
  • Customer Data Strategy
  • Customer Service
  • Data Privacy & Security
  • Marketing
  • Partners
  • Treasure Data CDP

The 4 Important Things about Analyzing Data Part 1: The Importance of Providing Many ‘Obvious’ Results

In the past few years, mass accumulation of data has increased. Meanwhile, the distributed parallel processing of the data has matured. Although the original focus was on the analysis back end (platforms) to support the accumulation of data and the efficiency of the batches, I believe that the importance of the “data,” the “analysis,” and that of ... The 4 Important Things about Analyzing Data Part 1: The Importance of Providing Many ‘Obvious’ Results

Eliminating Schema Rot in MPP Databases Like Redshift

The MPP database is an incredible piece of technology. These databases run large-scale analytic queries very quickly, making them great tools for iterative data exploration. With a cloud offering like Redshift in the market, MPP databases are enjoying increasing adoption today outside of enterprise IT. However, like any other great technology, they excel in some ... Eliminating Schema Rot in MPP Databases Like Redshift

Managing the Data Pipeline with Git + Luigi

One of the common pains of managing data, especially for larger companies, is that a lot of data gets dirty (which you may or may not even notice!) and becomes scattered around everywhere. Many ad hoc scripts are running in different places, these scripts silently generate dirty data. Further, if and when a script results ... Managing the Data Pipeline with Git + Luigi

Learn SQL by Calculating Customer Lifetime Value Part 2: GROUP BY and JOIN

This is the second installment of our SQL tutorial blog series. In the first part, we set up the data source with SQLite and learned how to filter and sort data. This time, we will learn two other key concepts in SQL: GROUP BY and JOIN. Get the FREE e-book based on this blog series! ... Learn SQL by Calculating Customer Lifetime Value Part 2: GROUP BY and JOIN

Treasure Data Raises $15 Million in Series B Financing

With TD’s cloud service, customers can begin to leverage their largest data sources without significant investments in time, specialized skills or new infrastructure. An alternative to Hadoop platforms or services,...

Learn SQL by Calculating Customer Lifetime Value Part 1: Setup, Counting and Filtering

Motivation As far as technical skills go, SQL is a really nice skill to have for product managers and product marketers. Instead of constantly running into performance issues in Excel or begging “technical” people to look stuff up for you, you can get answers to your questions directly from data. Unfortunately, good, non-encyclopedic resources for ... Learn SQL by Calculating Customer Lifetime Value Part 1: Setup, Counting and Filtering

12 Open Source Software Innovations from Treasure Data Engineers

TD is proud to have some of the best technical minds in the world working on our unique managed service. When they’re not working on the TD Service or supporting our customers, many of our engineers continue to support technological innovation by...

How to Get More Clicks for Digital Advertising: Step by Step Guide to Optimizing CTRs with Real-time Data + Machine Learning

In the digital advertising space, optimizing the CTR (Click Through Rate) is one of the major challenges to increasing the performance of the advertising networks. Often times, machine learning algorithms are used to optimize what ads are relevant to the incoming visitors by learning from the historical impression and click logs. However, collecting and running machine ... How to Get More Clicks for Digital Advertising: Step by Step Guide to Optimizing CTRs with Real-time Data + Machine Learning

Amazon Recommends Fluentd as “Best Practice for Data Collection” over Flume and Scribe

This month, Parviz Deyham from Amazon Web Service promoted as the best data collection tool for Amazon Elastic MapReduce (EMR), a hosted Hadoop framework running on Amazon Elastic Compute Cloud (EC2) and Amazon Simple Storage Service (S3)...

Transform customer data into your most valuable business asset