One of the pages have a form that has to accept inputs in multiple languages.
The field values provided as input are stored in a crx-node for further processing.
UTF-8 encoding is being carried out to encode the form values. Even though most of the characters are getting encoded properly, we observe that certain slovenian characters are not getting encoded.
In the following sentence "Děkujeme Vám za Váš zájem o návštěvu Floridy.", all the characters except 'ě' is getting UTF-8 encoded properly.
We find the encoding is getting lost at the server level for this character and it is replaced with following :
D�?kujeme Vám za Váš zájem o návšt�?vu Floridy
For carrying out UTF-8 encoding, we are performing the following:
In the head: <meta http-equiv="Content-Type" content="text/html;charset=UTF-8">
For the form: <form method="post" enctype="multipart/form-data" accept-charset="UTF-8" >
In the Servlet: request.setCharacterEncoding("UTF-8");
But when trying to read the request, the character encoding is not getting retained.
Below is the way to reproduce this issue,
Create a form that accepts user inputs and store the values in a crx node. The input value entered should be "Děkujeme Vám za Váš zájem o návštěvu Floridy.". The value should be stored as it is in the crx node.
CQ5.5 is used.
Any help in this respect is much appreciated.
Are you facing this issue when you hit instance directly without webserver? Is the issue happening geometrixx also?
Its through web server only. Not directly.
Got the issue resolved by adding hidden input type in jsp
<input type="hidden" name="_charset_" value="UTF-8" />