Tag: Apache
-
Apache web server hardening and security guide
Posted on April 21, 2024, Level intermediate Resource Length medium
The Web Server is a crucial part of web-based applications. Apache Web Server is often placed at the edge of the network; hence it becomes one of the most vulnerable services to attack. A practical guide to secure and harden Apache HTTP Server. By Chandan Kumar.
Tags apache web-development cloud software-architecture infosec
-
Introducing WAP pattern support with Apache Iceberg
Posted on February 3, 2024, Level intermediate Resource Length long
If you're using SQLMesh alongside Apache Spark and Apache Iceberg, I have some exciting news for you! Starting from version 0.57.0, SQLMesh applies the Write-Audit-Publish (WAP) pattern when executing models using Apache Spark and the Apache Iceberg data format. The best part? No user action is required to enable this behavior - it's enabled by default. By Iaroslav Zeigerman.
Tags analytics big-data app-development apache devops
-
Apache ZooKeeper: The perfect tool for configuration management
Posted on December 17, 2023, Level beginner Resource Length medium
Apache ZooKeeper is an open-source distributed coordination system that provides a platform for configuration management, process synchronization, and lock management. Originally developed by Yahoo, it is now maintained by the Apache Software Foundation. By datascientest.com.
Tags event-driven software-architecture management devops apache
-
Web server load balancing: Techniques and best practices
Posted on August 25, 2023, Level beginner Resource Length medium
Unveiling Dart 3.1: A New Horizon for Functional Programming in Flutter Companies across the globe seek fast system performance and quick responses when it comes to websites and modern applications. Often such high traffic websites must cater to millions of requests from end users as well as clients simultaneously. In such scenarios, a single server may not be able to handle the network traffic. By Hitesh Jethva.
Tags web-development app-development servers apache nginx
-
How to enable HSTS for enhanced web security in Apache
Posted on May 13, 2023, Level intermediate Resource Length medium
HTTP Strict Transport Security (HSTS) is a web security policy mechanism that helps to protect websites against protocol downgrade attacks and cookie hijacking. It allows web servers to declare that web browsers (or other complying user agents) should interact with it using only secure HTTPS connections, and never via the insecure HTTP protocol. This article will guide you on how to implement and optimize HSTS in Apache for improved web security. By Rahul.
Tags app-development infosec web-development apache ssl
-
Simplified data pipelines with Pulsar transformation functions
Posted on April 24, 2023, Level intermediate Resource Length medium
They provide a low-code way to develop basic processing and routing of data using existing Pulsar features. Using functions in the cloud is a very efficient way of creating iterable workflows that can transform data, analyze source code, make platform configurations, and do many other useful jobs. As you develop a function you will quickly realize a need for a solid foundation of utilities and formatting. By Christophe Bornet.
Tags app-development data-science apache big-data
-
Comparing Avro vs Protobuf for data serialization
Posted on April 18, 2023, Level beginner Resource Length short
Data serialization is a crucial aspect of modern distributed systems because it enables the efficient communication and storage of structured data. In this article, we will discuss two popular serialization formats: Avro and Protocol Buffers, Protobuf for short, and compare their strengths and weaknesses to help you make an informed decision about which one to use in your projects. By Daniel Selans.
Tags json queues messaging app-development streaming apache
-
Using Vulcan codecs with Kafka Java APIs
Posted on April 17, 2023, Level intermediate Resource Length medium
For those that aren't familiar, Vulcan is a functional Avro encoding library that uses the official Apache Avro library under the hood. The difference between this and the official Avro build plugins approach is that the types are defined in plain Scala. Then the Avro schema is generated from those instead of defining the Avro schema and getting code generated at compile time that adheres to that schema. By César Enrique.
Tags apache java messaging app-development streaming scala
-
Real-time data linkage via Linked Data Event Streams
Posted on April 12, 2023, Level intermediate Resource Length long
Real-time interchanging data across domains and applications is challenging; data format incompatibility, latency and outdated data sets, quality issues, and lack of metadata and context. A Linked Data Event Stream (LDES) is a new data publishing approach which allows you to publish any dataset as a collection of immutable objects. The focus of an LDES is to allow clients to replicate the history of a dataset and efficiently synchronize with its latest changes. By towardsai.net.
Tags data-science streaming performance how-to big-data apache
-
Deploy Apache Flink cluster on Kubernetes
Posted on March 11, 2023, Level intermediate Resource Length medium
When it comes to deploying Apache Flink on Kubernetes, you can do it in two modes, either session cluster or job cluster. A session cluster is a running standalone cluster that can run multiple jobs, while a Job cluster deploys a dedicated cluster for each job. By Elvis David.
Tags apache devops cloud data-science big-data
-
How to orchestrate an ETL Data Pipeline with Apache Airflow
Posted on March 10, 2023, Level intermediate Resource Length medium
Data Orchestration involves using different tools and technologies together to extract, transform, and load (ETL) data from multiple sources into a central repository. By Aviator Ifeanyichukwu.
Tags apache database nosql data-science python big-data
-
Using Apache Kafka to process 1 trillion inter-service messages
Posted on January 27, 2023, Level intermediate Resource Length long
Cloudflare has been using Kafka in production since 2014. We have come a long way since then, and currently run 14 distinct Kafka clusters, across multiple data centers, with roughly 330 nodes. Between them, over a trillion messages have been processed over the last eight years. By Matt Boyle.
Tags event-driven apache apis app-development database