Treasure Data CDP Resources

  • Filter by Resource Type
  • Articles
  • Blog
  • Case Studies
  • Cheatsheets
  • Events
  • Reports
  • Webinars
  • Filter by Industry
  • Automotive
  • CPG
  • Entertainment & Media
  • Financial Services
  • Healthcare
  • Retail
  • Technology
  • Travel & Hospitality
  • Filter by Topic
  • AI & Machine Learning
  • CDP
  • CDP Use Cases
  • Company News
  • Customer Data Strategy
  • Customer Service
  • Data Privacy & Security
  • Marketing
  • Partners
  • Treasure Data CDP

Making Magic with pandas-td

Magic functions enable common tasks by saving you typing. (NOTE:  Pandas itself doesn’t have magic functions; the IPython kernel does.) Magic functions are functions preceeded by a % symbol. Magic functions have been introduced into pandas-td version 0.8.0!  Toru Takahashi from Treasure Data walks us through. Treasure Data’s magic functions work by wrapping a separate ... Making Magic with pandas-td

Collecting All Docker Logs with Fluentd

Just in case you have been offline for the last two years, Docker is an open platform for distributed apps for developers and sysadmins. By turning your software into containers, Docker lets cross-functional teams ship and run apps across platforms seamlessly...

Data Science 101: Interactive Analysis with Jupyter, Pandas and Treasure Data

TD gives you a cloud-based analytics infrastructure accessible via SQL. Our interactive engines like Presto give you the power to crunch billions of records with ease. As a data scientist, you’ll still need to learn how to write basic SQL queries...

Python 101 for Aspiring Data Nerds

As a data scientist, or anyone interested in collecting data for that matter, it’s no doubt helpful to know about how to go about collecting the data in your app – data that you’ll want to later query and analyze. Here, we’ll build an app in Python from A-Z,  iterate on it to make it ... Python 101 for Aspiring Data Nerds

New UDFs in Presto: currency conversion and geocoding tools

Today, we introduced three new UDFs (user-defined functions) to to TD‘s Presto offering. They are...

The 4 Important Things About Analyzing Data Part 2: Understand the Purpose of the Analysis and Who Needs the Results

Before analyzing data, it is important to first clearly understand for whom and for what purpose you are conducting the analysis. This is essential because analytics assist humans in making decisions. Therefore, conducting the analysis to produce the best results for the decisions to be made is an important part of the process, as is ... The 4 Important Things About Analyzing Data Part 2: Understand the Purpose of the Analysis and Who Needs the Results

Why the Unified Logging Layer Matters

The amount of logs produced today is staggering. The logs provide opportunities for analysis to better understand customers and continually improve products. The log collection pipeline, then, becomes a source of valuable data. Collecting and unifying the data for better consumption and analysis can be a challenge. It is important to understand the nuances of ... Why the Unified Logging Layer Matters

Treasure Data Joins the Linux Foundation

Today is a big step forward for our customers and community in general, as we officially join the Linux Foundation. As you may know, our company is driven by an open source culture: We believe that continuous innovation,...

Presto versus Hive: What You Need to Know

There is much discussion in the industry about analytic engines and, specifically, which engines best meet various analytic needs. This post looks at two popular engines, Hive and Presto, and assesses the best uses for each. How Hive Works Hive translates SQL queries into multiple stages of MapReduce and it is powerful enough to handle ... Presto versus Hive: What You Need to Know

Transform customer data into your most valuable business asset