Posts

Showing posts with the label tensorflow

Featured Post

8 Ways to Optimize AWS Glue Jobs in a Nutshell

Image
  Improving the performance of AWS Glue jobs involves several strategies that target different aspects of the ETL (Extract, Transform, Load) process. Here are some key practices. 1. Optimize Job Scripts Partitioning : Ensure your data is properly partitioned. Partitioning divides your data into manageable chunks, allowing parallel processing and reducing the amount of data scanned. Filtering : Apply pushdown predicates to filter data early in the ETL process, reducing the amount of data processed downstream. Compression : Use compressed file formats (e.g., Parquet, ORC) for your data sources and sinks. These formats not only reduce storage costs but also improve I/O performance. Optimize Transformations : Minimize the number of transformations and actions in your script. Combine transformations where possible and use DataFrame APIs which are optimized for performance. 2. Use Appropriate Data Formats Parquet and ORC : These columnar formats are efficient for storage and querying, signif

The Growth of Machine Learning till TensorFlow

Image
The Internet and the vast amount of data are inspirations for CEOs of big corporations to start to use Machine learning. It is to provide a better experience to users. How TensorFlow Starts Let us take Amazon, online retail that uses Machine learning. The algorithm's purpose is to generate revenue. Based on user search data, the ML application provides information or insights. The other example is the advertising platform where Google is a leader in this line. Where it shows ads based on the user movements while surfing the web. These are just a few, but there are many in reality. Machine Learning Evolution Top ML Frameworks Torch The torch is the first framework developed in 2002 by Ronan Collobert. Initially, IBM and Facebook have shown much interest. The interface language is Lua. The primary focus is matrix calculations. It is suitable for developing neural networks. Theano It is developed in 2010 by the University of Montreal. It is highly reliable to process graphs (GPU). The