Tag: Data science
-
PyTorch on Google Cloud: How to train PyTorch models on AI Platform
Posted on September 23, 2022, Level intermediate Resource Length long
PyTorch has been predominantly used in research and in recent years it has gained tremendous traction in the industry as well due to its ease of use and deployment. By Rajesh Thallam, Vaibhav Singh.
Tags gcp google data-science python json
-
How to run a Stable Diffusion server on Google Cloud Platform (GCP)
Posted on September 22, 2022, Level intermediate Resource Length medium
Open-sourced alternative to OpenAI's gated DALLĀ·E 2 with comparable quality, Stable Diffusion offers something to everyone: end-users can generate images virtually for free, developers can embed the model into their service, ML engineers can investigate and modify the code, and researchers have full leeway to push the state of the art even further. By Iulia Turc.
Tags gcp devops data-science open-source google
-
Deep dive: How AI content generators work
Posted on September 11, 2022, Level beginner Resource Length medium
Artificial intelligence (AI) has been steadily influencing business processes, automating repetitive and mundane tasks even for complex industries like construction and medicine. By Victor Dey.
Tags big-data machine-learning data-science software-architecture
-
Uber freight carrier metrics with near-real-time analytics
Posted on September 7, 2022, Level beginner Resource Length long
Uber Freight has been around since 2016 and is dedicated to provide a platform to seamlessly connect shippers with carriers. We're simplifying the lives of trucking companies by providing a platform for carriers to browse through all available shipment opportunities with upfront pricing and book with the tap of a button, and making the fulfillment process more scalable and efficient. By Ujwala Tulshigiri, Yeqing Lu, Ting Chen, Branden Colen.
Tags data-science apache event-driven messaging distributed devops
-
Why you absolutely must document your no-code software
Posted on September 4, 2022, Level beginner Resource Length medium
Here's a conceptual framework to help keep your no-code and low-code effort simple, but comprehensive. By Joe Procopio.
Tags software-architecture big-data bots programming data-science devops
-
How to build low-code mixed-reality apps in Power Apps, AI Builder and HoloLens
Posted on September 3, 2022, Level beginner Resource Length medium
Off-the-shelf tools and components like Power Apps, AI Builder and HoloLens let you deliver your own industrial metaverse at a low cost. Microsoft's low-code Power Platform is a key piece of its business software suite. By sitting between Microsoft 365's productivity tools and the Dynamics 365 line-of-business applications, it's a way to build out custom workflows and add your own user experiences. By Simon Bisson.
Tags software-architecture big-data bots management data-science software
-
Delta Lake and MinIO for multi-cloud data lakes
Posted on August 31, 2022, Level beginner Resource Length medium
Delta Lake is an open-source storage framework that is used to build data lakes on top of object storage in a Lakehouse architecture. Delta Lake supports ACID transactions, scalable metadata handling and unified streaming and batch data processing. By Matt Sarrel.
Tags cio big-data cloud database data-science
-
AI Ethics framework
Posted on August 23, 2022, Level beginner Resource Length long
Learn about AI ethics, a framework that guides data scientists and researchers to build AI systems in an ethical manner to benefit society as a whole. By @IBM.
Tags big-data cio data-science frameworks miscellaneous
-
Introduction to Apache Spark and its Datasets
Posted on August 15, 2022, Level intermediate Resource Length medium
In this article, we will introduce you to the big data ecosystem and the role of Apache Spark in Big data. We will also cover the Distributed database system, the backbone of big data. In today's world, data is the fuel. Almost every electronic device collects data that is used for business purposes. By Abhishek Jaiswal.
Tags big-data data-science database miscellaneous
-
Expanding the CAP tradeoff frontier at scale
Posted on August 14, 2022, Level advanced Resource Length medium
Distributed systems must balance their needs for high availability and low latency with consistency guarantees; providing a mostly hit happy path for requests enables these systems to push the boundaries of this tradeoff. By Audrey Cheng.
Tags database big-data data-science performance devops
-
Benchmarking time series workloads on Apache Kudu using TSBS
Posted on August 13, 2022, Level intermediate Resource Length medium
Since the open-source introduction of Apache Kudu in 2015, it has billed itself as storage for fast analytics on fast data. This general mission encompasses many different workloads, but one of the fastest-growing use cases is that of time-series analytics. By Todd Lipcon.
Tags analytics big-data data-science performance devops
-
How I use stories to share data at meetings
Posted on July 13, 2022, Level beginner Resource Length long
Want to keep people awake at data presentations? Try sharing your findings backwards. You've worked it. Now put your chart down, flip it and reverse it. By Ryan Estrellado.
Tags miscellaneous cio agile data-science management