Known issues for Cloudera Data Services on premises 1.5.5 SP2
List about the known issues and limitations, their areas of impact, and workarounds in Cloudera Data Services on premises 1.5.5 SP2.
The known issues in Cloudera Data Services on premises 1.5.5 are carried into Cloudera Data Services on premises1.5.5 SP2.
For more information, see Known Issues.
For more information on 1.5.5 SP1 known issues, see Known Issues.
Known issue identified in 1.5.5 SP2
The following are the known issues identified in 1.5.5 SP2:
- OPSX-6950 - DRS Restore fails due to ClusterIP allocation conflict
- During a DRS restore, the restore operation can fail with a
Kubernetes error indicating that a service ClusterIP is already
allocated. This occurs when the restore process attempts to recreate a service using a
ClusterIP that is currently in use by another existing service in
the cluster.
A typical error message looks like:
service "cdp-release-cert-manager-cainjector" is invalid: spec.clusterIPs: failed to allocate IP <IP_ADDRESS>: provided IP is already allocated
- OPSX-6867 - Post upgrade validation fails due to longhorn-system pods in CrashLoopBackOff
- During an upgrade to 1.5.5 SP2, some Longhorn CSI
plugin pods gets into a
terminatingstate, which causes the upgrade to fail. This problem has been observed when Longhorn is not configured to use dedicated disks, leading to instability in the storage components and preventing proper pod shutdown and restart.
- OPSX-6858 - Cloudera Embedded Container Service first run fails at install-cp step due to mke2fs failure
- During some Cloudera Embedded Container Service installations,
the first-run process fails at the
install-cpstep because certain pods remain in aCreatingstate. The underlying cause is a failure to mount the associated Longhorn volume, which leads to an error when Kubernetes tries to format the block device. The pod event shows a message as follows:
This happens when the Longhorn PVC’s block device still contains stale filesystem or partition metadata from previous use. Because the device appears in use theWarning FailedMount ... MountVolume.MountDevice failed for volume "pvc-…" rpc error: code = Internal desc = format of disk "/dev/longhorn/pvc-…" failed: … mke2fs … /dev/longhorn/pvc-… is apparently in use by the system; will not make a filesystem here!mke2fscommand cannot create a new filesystem, blocking the pod from starting.
