Tag: Big data
-
Six principles for building robust yet flexible shared data applications
Posted on December 23, 2020, Level beginner Resource Length medium
Paul Done brought together a set of techniques he has identified to effectively deliver resilient yet evolvable data-driven applications.
Tags data-science big-data management cio
-
Getting started with distributed TensorFlow on GCP
Posted on December 22, 2020, Level beginner Resource Length medium
For many in the world of data science, distributed training can seem a daunting task. In addition to building and thoughtfully evaluating a high-quality ML model, you have to be aware of how to optimize your model for specific hardware and manage infrastructure. By Nikita Namjoshi.
Tags big-data data-science software gcp google
-
Modelling the time-of-arrival using distributions
Posted on December 21, 2020, Level beginner Resource Length medium
Estimating the time-of-arrival is a common problem in a wide range of settings, e.g. in logistics. This post will show a distribution-based approach that enables us to get more insights about arrival times and how we could use this information for decision-making in the logistics industry. By Jonas Laake.
Tags big-data data-science software
-
How to grid search deep learning models for time series forecasting
Posted on November 29, 2020, Level intermediate Resource Length medium
Grid searching is generally not an operation that we can perform with deep learning methods. This is because deep learning methods often require large amounts of data and large models, together resulting in models that take hours, days, or weeks to train. By Jason Brownlee.
Tags how-to machine-learning big-data data-science
-
Modern Distributed Data Architecture with Event Streams, Stream Processing and Derived Data
Posted on November 12, 2020, Level beginner Resource Length medium
Some of the most interesting projects I worked on at LinkedIn involved building large scale real-time pricing and machine learning products. They required crafting fault-tolerant distributed data architectures to support model training, forecasting and dynamic control systems. By Luthfur Chowdhury.
Tags cloud streaming software-architecture big-data cio data-science
-
Getting started with Python library Numpy
Posted on November 6, 2020, Level beginner Resource Length medium
NumPy is a open source Python library that handles multidimensional arrays and matrices with a huge library of mathematical functions to manipulate arrays. By Shahid Siddique.
Tags json big-data data-science python
-
How to run TensorFlow Lite models on Raspberry Pi
Posted on November 5, 2020, Level beginner Resource Length long
Ahmed Fawzy Gad published this piece about running machine learning on edge devices. The deep learning models created using TensorFlow require high processing capabilities to perform inference. Fortunately, there is a Lite version of TensorFlow called TensorFlow Lite (TFLite for short) which allows such models to run on devices with limited capabilities. Inference is performed in less than a second.
Tags machine-learning app-development robotics big-data data-science
-
How many jobs do robots really replace?
Posted on October 30, 2020, Level beginner Resource Length medium
New research puts a number on the job costs of automation. In many parts of the U.S., robots have been replacing workers over the last few decades. But to what extent, really? Some technologists have forecast that automation will lead to a future without work, while other observers have been more skeptical about such scenarios. By Massachusetts Institute of Technology, original written by Peter Dizikes.
Tags robotics big-data data-science teams career
-
Data-driven software architecture
Posted on October 29, 2020, Level beginner Resource Length short
The web continues to grow and expand in innovative ways. At the same time, few seem to notice that the way web applications are currently built is dragging us ever deeper into technical debt. Software-centric architecture and legacy technologies used to implement it are at odds with the potential of the web. At best they are wildly inefficient, and at worst they are heading us into a software apocalypse. By AtomGraph.
Tags software-architecture containers web-development app-development agile big-data
-
Setting Google Analytics on a Jekyll website and implementing cookie consent collect
Posted on October 27, 2020, Level beginner Resource Length long
It is very useful to have Google Analytics on your website to have an idea of how many (or how few) people are visiting. Using Google Analytics on a website requires a valid consent from visitors (Because of ePrivacy Directive and not the GDPR). By Coralie Collignon.
Tags analytics miscellaneous big-data web-development
-
As AI chips improve, is TOPS the best way to measure their power?
Posted on October 14, 2020, Level beginner Resource Length medium
About the challenge of evaluating AI chip performance using "TOPS", a metric that means trillions of operations per second, or "tera operations per second". By Jeremy Horwitz.
Tags big-data machine-learning data-science robotics performance
-
US begins $1 billion quantum computing plan to get ahead of 'adversaries'
Posted on August 30, 2020, Level beginner Resource Length medium
The government is funding basic research in AI, too. When big technologies like mobile phones, 5G networks and e-commerce arrive, it's important to get in on the ground floor. That's why the US government is establishing 12 new research centers, funded with hundreds of millions of dollars, to boost artificial intelligence and quantum computing. By Stephen Shankland @cnet.com.
Tags how-to cloud data-science machine-learning big-data software