Overview of NRP and the Nautilus Cluster

The National Research Platform (NRP) is a unique partnership of over 50 institutions, led by researchers at UC San Diego, University of Nebraska-Lincoln, and the Massachusetts Green High Performance Computing Center. Supported by the National Science Foundation (NSF), Department of Energy (DOE), and Department of Defense (DoD) among others, the NRP is a community-owned research and education platform. Its mission is to connect researchers and educators, fostering collaboration, accelerating innovation, and sharing resources. The NRP provides access to cutting-edge technologies in AI, high-performance computing, data storage, and networking, available free of charge to non-profit research and education institutions. As of early 2024, the NRP connects over 400 nodes across 70+ locations on 3 continents, serving over 5,000 users.

The NRP Nautilus Cluster is a key component of this national infrastructure. Nautilus is a HyperCluster specifically designed for running containerized Big Data and AI applications. It leverages Kubernetes (K8S) for managing applications and Rook for Ceph data services automation. Nautilus features a diverse range of computational resources, including CPUs, GPUs (from consumer-grade to high-end AI accelerators like NVIDIA A100s and H100s), FPGAs, and a federated national-scale CDN.

Data Policy: The NRP Nautilus Cluster, like all NRP resources, currently has no storage suitable for HIPAA, PID, FISMA, FERPA, or protected data of any kind. Users are not permitted to store such data on NRP machines.

Getting Started with Nautilus

For UCR-specific assistance or initial inquiries, please contact:

For general NRP support and documentation, refer to the following resources:

Using the Nautilus Cluster

This section provides links to essential documentation for leveraging the capabilities of the Nautilus Cluster.

Core Concepts & Tutorials

Running Computational Tasks

Storage on Nautilus

Nautilus offers a variety of storage solutions to meet different research needs.

  • Storage Options Overview: An introduction to the available storage types.
  • Ceph FileSystem (FS): Provides home directories and project spaces. Accessed via PersistentVolumeClaim in Kubernetes. (Details)
  • Ceph S3 Object Storage: Scalable object storage compatible with the S3 API. (Details)
  • CVMFS (CernVM File System): Used for distributing software and data efficiently. (Details)
  • Local Scratch Storage: Fast, temporary storage available on worker nodes. Data is not persistent. (Details)
  • Data Management:
  • For other storage solutions like Linstor or Nextcloud, refer to the main Nautilus Storage Documentation.

AI and Machine Learning

Nautilus is well-equipped for a wide range of AI and Machine Learning workloads.

Jupyter and Interactive Computing

Development and Version Control

Specialized Hardware and Networking

Additional Resources & Community

The NRP Nautilus Cluster, as part of the broader National Research Platform, offers a rich set of resources and a collaborative environment to advance computational research and education.