» yandex_dataproc_cluster

Get information about a Yandex Data Proc cluster. For more information, see the official documentation.

» Example Usage

data "yandex_dataproc_cluster" "foo" {
  name = "test"
}

output "service_account_id" {
  value = "${data.yandex_dataproc_cluster.foo.service_account_id}"
}

» Argument Reference

The following arguments are supported:

  • cluster_id - (Optional) The ID of the Data Proc cluster.
  • name - (Optional) The name of the Data Proc cluster.
  • folder_id - (Optional) The ID of the folder that the resource belongs to. If it is not provided, the default provider folder is used.

» Attributes Reference

In addition to the arguments listed above, the following computed attributes are exported:

  • bucket - Name of the Object Storage bucket used for Data Proc jobs.
  • cluster_config - Configuration and resources of the cluster. The structure is documented below.
  • created_at - The Data Proc cluster creation timestamp.
  • description - Description of the Data Proc cluster.
  • id - Id of the Data Proc cluster.
  • labels - A set of key/value label pairs assigned to the Data Proc cluster.
  • service_account_id - Service account used by the Data Proc agent to access resources of Yandex.Cloud.
  • zone_id - ID of the availability zone where the cluster resides.

The cluster_config block supports:

  • version_id - Version of Data Proc image.
  • hadoop - Data Proc specific options. The structure is documented below.
  • subcluster_spec - Configuration of the Data Proc subcluster. The structure is documented below.

The hadoop block supports:

  • services - List of services launched on Data Proc cluster.
  • properties - A set of key/value pairs used to configure cluster services.
  • ssh_public_keys - List of SSH public keys distributed to the hosts of the cluster.

The subcluster_spec block supports:

  • id - ID of the Data Proc subcluster.
  • name - Name of the Data Proc subcluster.
  • role - Role of the subcluster in the Data Proc cluster.
  • resources - Resources allocated to each host of the Data Proc subcluster. The structure is documented below.
  • subnet_id - The ID of the subnet, to which hosts of the subcluster belong.
  • hosts_count - Number of hosts within Data Proc subcluster.

The resources block supports:

  • resource_preset_id - The ID of the preset for computational resources available to a host. All available presets are listed in the documentation.
  • disk_size - Volume of the storage available to a host, in gigabytes.
  • disk_type_id - Type of the storage of a host.