How does character encoding affect file migration from RHEL7 (EUC-JP) to ONTAP (UTF-8)
Applies to
- ONTAP 9
- Red Hat Enterprise Linux 7 (RHEL7) with EUC-JP encoding
- NFS (v3, v4, v4.1)
- SMB (SMB1, SMB2)
- FTP
Answer
- When migrating files from a server with
EUC-JPencoding (such as RHEL7) to an ONTAP system usingUTF-8encoding, special considerations must be taken to avoid issues such as file name corruption. - Here's how different protocols handle character encoding during migration:
- NFSv4.x and SMB (CIFS):
- These protocols inherently use Unicode, which minimizes the risk of character encoding issues during migration.
- ONTAP systems, when using NFSv4.x or SMB, automatically handle the conversion between different character encodings.
- Therefore, when migrating data via these protocols, ONTAP does not require any special configuration for character encoding.
- NFSv3:
- Unlike NFSv4.x, NFSv3 does not enforce the use of Unicode.
- When using NFSv3 to migrate data, it's crucial to ensure that the destination ONTAP system is configured to use the same character encoding as the source. However, due to potential risks of file name corruption, it's recommended to avoid using NFSv3 for migration if the source and destination have different character encodings.
- Instead, consider using NFSv4.x or SMB, which provide better support for encoding conversion.
- FTP:
- The FTP protocol does not specify character encoding, which means it's up to the client and server to ensure they agree on the encoding used.
- When transferring files via FTP, make sure both the FTP client and the ONTAP system are set to use UTF-8 to prevent any character encoding issues.
- NFSv4.x and SMB (CIFS):
Additional Information
additionalInformation_text
