Skip to content
@cerndb

CERN Database and Analytics Group

Popular repositories Loading

  1. dist-keras dist-keras Public archive

    Distributed Deep Learning, with a focus on distributed training, using Keras and Apache Spark.

    Python 623 170

  2. spark-dashboard spark-dashboard Public

    Spark-Dashboard is a solution for monitoring Apache Spark jobs. This repository provides the tooling and configuration for deploying an Apache Spark Performance Dashboard using containers technology.

    Dockerfile 99 21

  3. SparkPlugins SparkPlugins Public

    Code and examples of how to write and deploy Apache Spark Plugins. Spark plugins allow runnig custom code on the executors as they are initialized. This also allows extending the Spark metrics syst…

    Scala 79 15

  4. hdfs-metadata hdfs-metadata Public

    Tool for gathering blocks and replicas meta data from HDFS. It also builds a heat map showing how replicas are distributed along disks and nodes.

    Java 56 19

  5. SparkDLTrigger SparkDLTrigger Public

    Code and links to the data for the article "Machine Learning Pipelines with Modern Big DataTools for High Energy Physics"

    Jupyter Notebook 29 13

  6. Hadoop-Profiler Hadoop-Profiler Public

    Hadoop Profiler, or hprofiler, is a tool which is able to analyze on- and off-CPU workloads on distributed computing environments.

    Shell 24 10

Repositories

Showing 10 of 66 repositories
  • SparkDLTrigger Public

    Code and links to the data for the article "Machine Learning Pipelines with Modern Big DataTools for High Energy Physics"

    cerndb/SparkDLTrigger’s past year of commit activity
    Jupyter Notebook 29 Apache-2.0 13 0 0 Updated Jun 11, 2024
  • opentelemetry-collector-contrib Public Forked from open-telemetry/opentelemetry-collector-contrib

    Contrib repository for the OpenTelemetry Collector

    cerndb/opentelemetry-collector-contrib’s past year of commit activity
    Go 0 Apache-2.0 2,148 0 0 Updated May 29, 2024
  • argo-helm Public Forked from argoproj/argo-helm

    ArgoProj Helm Charts

    cerndb/argo-helm’s past year of commit activity
    Mustache 0 Apache-2.0 1,817 0 0 Updated May 28, 2024
  • SparkTraining Public

    Material for the course "Introduction to Apache Spark APIs for Data Processing" https://sparktraining.web.cern.ch/

    cerndb/SparkTraining’s past year of commit activity
    Jupyter Notebook 9 CC-BY-4.0 5 0 0 Updated May 23, 2024
  • NotebooksExamples Public

    This repository contains Jupyter notebook examples, intended to be linked with the SWAN Gallery

    cerndb/NotebooksExamples’s past year of commit activity
    Jupyter Notebook 1 Apache-2.0 0 0 0 Updated May 16, 2024
  • spark-dashboard Public

    Spark-Dashboard is a solution for monitoring Apache Spark jobs. This repository provides the tooling and configuration for deploying an Apache Spark Performance Dashboard using containers technology.

    cerndb/spark-dashboard’s past year of commit activity
    Dockerfile 99 Apache-2.0 21 1 0 Updated May 16, 2024
  • SparkPlugins Public

    Code and examples of how to write and deploy Apache Spark Plugins. Spark plugins allow runnig custom code on the executors as they are initialized. This also allows extending the Spark metrics systems with user-provided monitoring probes.

    cerndb/SparkPlugins’s past year of commit activity
    Scala 79 Apache-2.0 15 3 0 Updated Apr 2, 2024
  • sparkMeasure Public

    This is a mirror of https://github.com/LucaCanali/sparkMeasure - sparkMeasure is a tool for performance troubleshooting of Apache Spark workloads. It simplifies the collection and analysis of Spark task metrics.

    cerndb/sparkMeasure’s past year of commit activity
    Scala 14 Apache-2.0 3 0 0 Updated Mar 11, 2024
  • jdbc-connector-for-apache-kafka Public Forked from Aiven-Open/jdbc-connector-for-apache-kafka

    Aiven's JDBC Sink and Source Connectors for Apache Kafka®

    cerndb/jdbc-connector-for-apache-kafka’s past year of commit activity
    Java 0 Apache-2.0 56 0 0 Updated Nov 8, 2023
  • zkpolicy Public

    Zookeeper Policy Audit Tool (aka zkPolicy) for checking and enforcing ACLs on ZNodes.

    cerndb/zkpolicy’s past year of commit activity
    Java 7 MIT 1 1 0 Updated Oct 25, 2023