Introduction

    OCTOPUS provides Data Provenance System for HPC (DPS4H), which enables the collection and viewing of provenance information.
    This manual outlines the procedures for searching, visualizing, viewing, and editing provenance information collected by DPS4H using a browser or CLI.
     
    This manual is primarily intended for the following users:

    • General users (group members)
    • Group administrators
    • System administrators
    • CLI users

 

System Requirements and Prerequisites

    Web Browser Operating Environment (Recommended)

      • OS: Windows 11, macOS
      • Browser: Microsoft Edge (Win), Google Chrome (Win), Safari (Mac), Firefox

     

    Accounts and Roles

      The content displayed on screens and available functions vary based on the user's permissions (role).
      This system has the following roles:

      • System Administrator: Can perform operations on all provenance
      • Group Administrator: Can perform operations on managed groups and their own provenance
      • Group Member: Can perform operations on their own provenance

     

    Session Management

      The session times out after 30 minutes of inactivity. Upon resuming operations, you will be automatically redirected to an error screen and will need to log in again.

 

Usage

    When you submit a job specifying the job class "DPS" for provenance collection on the front-end server, the Tracer on the compute node will begin collecting information and executing the job. For details on job classes, please refer to this page.
     
    The Aggregator constructs provenance information once per hour. Since this construction process cannot be started concurrently, if the previous batch processing is incomplete, it will be skipped and executed during the next startup.
     
    Users can search provenance information via browser or CLI. The same search conditions can be specified in both the browser and CLI. Graph display, destination settings, and permanent link operations are performed via the browser.

 

Authentication and Login

    Login

      This section explains the steps to log in via your browser.

      • Access the login URL (https://scup.hpc.osaka-u.ac.jp/).
      • Enter your ID or email address, password, and one-time passcode. Your ID and password are the same as those for OCTOPUS.

       
      * For your first login, you must set up two-factor authentication. A QR code will be displayed during your first login. Please set this up using a two-factor authentication app like Google Authenticator.

     

    Logout

      You can log out by selecting "Logout" (ログアウト) from the user name menu in the top right corner of the screen.

     

    When Accessing via Permanent Link

      Users can share provenance information with external parties by issuing a permanent link for jobs they have executed. Anyone accessing the permanent link issued by the user can view the provenance information without authentication or authorization. When accessing via a permanent link, functions such as setting the publication destination, issuing links, or searching are disabled.

 

Searching and Viewing Provenance Information

    Search Fields

      The main search fields are as follows.

      Search Field Input Format Matching Condition Notes/Details
      Job ID Numbers or colons Exact match The format differs from the NQSV JOBID displayed on OCTOPUS.
      Please convert it using the following procedure.
      [Conversion Procedure]

      • Remove ".oct" from the Job ID
      • Convert the job start date to YYYYMMDD format
      • Enter in the format "{Job ID}:{Job start date}"

      [Example]

      • Original Job ID: 49812.oct
      • Job Start Date: 2025/11/30
      • Search Input Value: 49812:20251130

      [Important Notes]

      • Do not use slashes or hyphens in dates
      • Do not include spaces before or after the colon
      ファイル名(file name) - Partial match -
      追加情報(additem) Either item name or value Partial match Additem refers to additional information entered by users.
      ジョブスクリプト名(script) - Partial match Name of job script
      実行ユーザ(uid) Numbers Partial match Uid of executing user
      実行日時, 時刻
      (Execution time range)
      Start date/time and
      end date/time
      - -
      公開設定
      (Visibility settings filter)
      非公開 (private),
      公開指定あり (shared) or
      全体公開 (public)
      - -

     

    Important Notes

      • Date/Time Specification: Searches can be performed down to the minute. Internally, data is managed with nanosecond precision. For example, specifying an end time of "23:59" includes data up to "23:59:59.999999999".
      • Visible data varies by role. Group members can view their own non-public data, but cannot see non-public data from other companies.

     

    How to Search Provenance

      Follow these steps to search:

      • Enter your search conditions (検索条件). Click "Open Advanced Conditions" (詳細条件を開く) to input detailed conditions.
      • Click "Clear" (クリア) to reset the search conditions.
      • Click "Search" (検索) to proceed to the search results screen.

       
      Search results (検索結果) include Job information, Process information, File information, and visibility settings.

      Type Description Main Attributes
      Job Job-level information. Public range and permanent links are managed per job. id, job_id, batch_script
      Process OS process-level information. Maintains parent-child relationships and MPI type. id, pid, uid, euid, egid, path, host, args, parent, start, mpi
      File File-level information. Appears in input/output relationships. id, uid, path, inode, host
      Visibility private / shared / public visibility

       
      The following operations are available on search results.

      • Sorting: Click a table header in the search result to sort ascending or descending order. This sorting persists as a composite sort.
      • Paging: Navigate between pages in the search result. You can also change the number of rows displayed per page (choose from 20, 50, or 100).
      • Job Selection: Click a row in the search result table to view detailed provenance information related to that job.

     

    Provenance Information Display (来歴情報表示)

      Clicking a row in the search results table allows you to view the provenance information associated with that job.
       
      [Provenance Information]

        A collection of information recording the Job, Process, File, and their relationships associated with the execution of a compute job.
        Job, Process, and File are collectively called entities. The relationships between entities are called relations.

       
      [Entity]

        The fundamental elements constituting provenance information. Consists of three types: Job, Process, and File.

        Type Description Main Attributes
        Job Job-level information. Public range and permanent links are managed per job. id, job_id, batch_script
        Process OS process-level information. Maintains parent-child relationships and MPI type. id, pid, uid, euid, egid, path, host, args, parent, start, mpi
        File File-level information. Appears in input/output relationships. id, uid, path, inode, host

       
      [Relations]

        These represent relationships between entities.

        Type Source → Destination Description
        parent Process → Process Indicates a relationship from parent to child process. In the graph, an arrow points from parent to child.
        input File → Process Indicates a process used the file as input.
        output Process → File Indicates a process outputted (created/updated) the file.

       
      Provenance information can be displayed in two ways: "Graph View" (グラフ表示) and "List View" (一覧表示).
       

      Graph View (グラフ表示)

        You can view a graph representing the entities as vertices and the relations as edges when the job was executed.
         
        You can drag and move the vertices and edges of the graph. Clicking a graph vertex displays its details.
        Additionally, clicking the icon in the upper-right corner enables the following operations:

        • Zoom In/Zoom Out
        • Fit to Screen
        • Export Screen: Download the graph as a PNG image.
        • Refresh: Resets the positions of moved vertices and edges.

       

      List View (一覧表示)

        Click "List View" (一覧表示) on the graph display screen to view a list of provenance information related to the job.
         
        The list view consists of two tabs: "Entity" (エンティティ) tab and "Relation" (リレーション) tab.
        Each displays a list of the job's entities and relations, respectively.
        The following operations are available for the list view:

        • Sorting: Click a table header in the list view to sort ascending or descending order. This sorting persists as a composite sort.
        • Paging: Navigate between pages in the list view. You can also change the number of rows displayed per page (choose from 20, 50, or 100).
        • View Details: Click a row to view detailed information.

 

Editing Additional Information

    You can add, update, or delete additional information (詳細情報).
     
    [Additional Information Input Rules]

    • Additional Item Name: Required field. Cannot duplicate other additional item names.
    • Additional Item Value: Required field.

 

Visibility Settings

    Click "Visibility Settings" (公開先設定) in the upper-right corner of the provenance display (来歴情報表示) screen to configure the visibility for the job's provenance information.
    You can select and set the Visibility from the following options:

    • Private (非公開)
    • Shared (公開指定あり): You can make it visible only to specified GIDs and UIDs.
    • Public (公開)

 

Using Permanent Links

    How to Create and Use Permanent Links

      Click "Create Link" (リンク発行) in the upper-right corner of the provenance display (来歴情報表示) screen to create a permanent link. Accessing the URL displayed after creating the permanent link allows you to view provenance information without logging in. Use this when sharing with external parties. When viewing via a permanent link, you cannot adjust visibility settings, create new links, or perform searches. To delete a permanent link, click "Delete Link" (リンク削除).

     

    Important Notes

      Please delete permanent links when they are no longer needed. Also, for long-term use, please review them periodically.

 

CLI Usage (Terminal/API Integration)

    You can search, update, and delete provenance using the CLI tool in the OCTOPUS front-end.
     

    Authentication

      Authentication is required before use.
      Execute the following command in the OCTOPUS front-end to begin authentication:

      dps-cli.sh login

      After executing the command, a message like the following will appear.
      Access the displayed URL in your browser or scan the QR code to open the authentication page.

      [INFO] デバイスフロー認証を開始します...

      1. ブラウザで以下のURLにアクセス: https://auth-red.onion.osaka-u.ac.jp/realms/OCTOPUS/device?user_code=XXXX-XXXX
         (下記QRコードをスマホ等で読み取れます)
       
       
       

      ############## ######## ###### #### ## #### ##############
      ## ## ########## ## ## ## ## ## ##
      ## ###### ## ## ## #### ###### ######## ## ###### ##
      ## ###### ## #### #### ###### #### ## ## ###### ##
      ## ###### ## ######## ## ## ## #### ## ## ###### ##
      ## ## ## ## #### #### ## ######## ## ## ##
      ############## ## ## ## ## ## ## ## ## ## ## ## ##############
      #### ## ## #### ## ##########
      #### #### ## ## ########## ## ## ## ## ##

      If prompted to log in, do so as you would when viewing your browser provenance.
      Selecting "Yes" on the "Grant Access to DPS CLI Client" page completes the authentication for the CLI tool.

     

    Operating Provenance Information

      To learn how to search, update, or delete provenance information, run the following command:

      dps-cli.sh help

      Below are examples of commands for operating provenance information.

      # Search provenance information
      dps-cli.sh search --limit 20 --offset 1 --job-id "10001:20250101,10000:20250101" --filename "test.txt" --from "2025/01/01 10:00" --to "2025/01/31 23:59"
       
      # Update provenance information
      dps-cli.sh update --id "Job:10000:20250101" --entity-type 1 --add-item-name "category" --add-item-value "experiment"
       
      # Delete provenance information
      dps-cli.sh delete --id "Job:10000:20250101" --entity-type 1

 

Operational Limitations

    • If none of the job ID, euid, or egid are provided, it will be excluded from the provenance information search.
    • Impact of server downtime due to server maintenance, malfunctions, etc.
      • If the server stops during provenance information collection, provenance information will not be built.
      • The upper memory limit for provenance collection is restricted to 2GB. If it exceeds 2GB, Tracer will restart and provenance information will not be built.
    • Multiple executions of jobs using the same input file may result in the same file entity ID being registered multiple times. If information is added to a file (Item: Add Items), appended information entered by other users sharing the same file entity ID will also be displayed.
    • PRS Product (Tracer/Aggregator) Limitations
      • Parent-child relationships cannot be linked for processes generated by methods other than fork.
      • Provenance information cannot be collected for programs running within containers.
      • MPI programs other than Open MPI 4.1.1 are unverified.
      • Provenance information for MPI programs will only be built if the following conditions are met:
        • Each process is executed by the same user.
        • Each process is launched with the same command line.
      • If the same MPI program is executed multiple times within the same job with the same arguments, those MPI programs will be recognized collectively as a single MPI program.
      • If the job ID is not obtained from the environment variables of a process, it will not be recognized as an MPI program.

 

Errors and Troubleshooting

    Common Issues

      • Timeout: You will be automatically logged out after 30 minutes of inactivity, and will need to log in again.
      • Data Loading/Update Failure: Please try again after some time. If the issue persists, please contact the support representative as described below.

     

    Inquiries

      If you encounter a problem that cannot be resolved, please contact the support representative (ou-octopus2@elsd.jp.nec.com) including the following information:

      • Date and time of occurrence
      • Steps to reproduce (which screen and what action was performed)
      • Job information such as Job ID
      • Screen title
      • Displayed message (error text)