Featured Post

8 Ways to Optimize AWS Glue Jobs in a Nutshell

Image
  Improving the performance of AWS Glue jobs involves several strategies that target different aspects of the ETL (Extract, Transform, Load) process. Here are some key practices. 1. Optimize Job Scripts Partitioning : Ensure your data is properly partitioned. Partitioning divides your data into manageable chunks, allowing parallel processing and reducing the amount of data scanned. Filtering : Apply pushdown predicates to filter data early in the ETL process, reducing the amount of data processed downstream. Compression : Use compressed file formats (e.g., Parquet, ORC) for your data sources and sinks. These formats not only reduce storage costs but also improve I/O performance. Optimize Transformations : Minimize the number of transformations and actions in your script. Combine transformations where possible and use DataFrame APIs which are optimized for performance. 2. Use Appropriate Data Formats Parquet and ORC : These columnar formats are efficient for storage and querying, signif

5 Tableau Features Useful for Data Analytics

Below are the top Tableau features for data analytics. Tableau 9 for Data Science engineers.

5 Useful Tableau Features for Data Analytics


CONNECTING TO LOCAL FILE 


Tableau can connect to any local file or database such as
  • Excel 
  • Text File
  • Access 
  • Statistical File, or 
  • Another Database file

CONNECTING TO SERVER

Tableau can connect to your data server too. It can connect to almost any type of data server.
Below are some of the most popular databases that Tableau can connect:
  • Tableau Server
  • Google Analytics
  • Google BigQuery
  • Hortonworks Hadoop Hive
  • MapR Hadoop Hive
  • IBM DB2
  • IBM BigInsights
  • IBM Netezza
  • Microsoft SQL Server
  • Microsoft Analysis Services
  • Oracle
  • Oracle Essbase
  • MySQL
  • PostgreSQL
  • SAP

While working on Tableau, data can have Live Connection where any change in the source data will be automatically updated in Tableau. On the other hand, data can be Extracted to the Tableau repository so that any change made here will not affect the original source data.

CONNECTING TO EXCEL FILE


To connect to an excel file, click “Excel” on the left hand side. Navigate to the file on your computer and double click to open it. For this tutorial, I will use a sample file that comes with the installation called “superstore”. You should open the appropriate file that you will be working with.

Now you are in the data connection window. It looks like the following Notice that I have three sheets in this file Orders, People, and Returns. I can simply drag the table I want. If I drag more than one table, Tableau automatically creates the join between the tables.

Creating charts


Based on the data we connected is easy. At the bottom of the page, Click on a sheet, Tableau automatically separates the data into Dimensions and Measures.

Dimensions are the categorical fields. These fields will create labels in the chart. Measures are the quantitative fields. These are the numbers we want to analyze. They create an axis in the chart. Sometimes, it might be confusing what type of chart should be used for specific data. Tableau has an interesting feature called “Show me”. “Show me” is the list of the possible charts that can be created using different combinations of data.

CREATING DASHBOARD


Tableau Dashboard contains all the related features intuitively interconnected to provide an interactive and real-time dashboard experience for non-technical users. To create a dashboard, click the “New Dashboard” icon at the bottom of the page. 

Comments

Popular posts from this blog

How to Fix datetime Import Error in Python Quickly

How to Check Kafka Available Brokers

SQL Query: 3 Methods for Calculating Cumulative SUM