Displacement of diacritics/accents from strings in python

Jesujoba ALABI 2019-07-04 04:47

I removed numbers using reg ex 'cos some the strings contain numbers but used the maketrans method from the string library to remove the punctuation marks.

import string
out = re.sub(r'[0-9\.]+', '', ins)
punct = str.maketrans({k: None for k in string.punctuation})
new_s = out.translate(punct)

ShadowRanger 2019-07-04 06:48:03

Note: You don't need to build the dict manually before passing to str.maketrans, you can just do str.maketrans('', '', string.punctuation) (the third argument is characters to be deleted in the translation table). Also note that this only handles ASCII punctuation, stuff like smart quotes (that many word processing tools substitute for ASCII single and double quotes automatically) aren't included in string.punctuation.

Related issues

How to use python cut method to create bins, accept one parameter and return appropriate bin?

Create a dictionary from a list of lists with certain criteria

selecting columns based on row value, Python, Pandas

plotting count of zeros and ones in a dataframe

BeautifulSoup find.all() web scraping returns empty

python function. output a keys list from a dictionary if the key is todays date

Best way to perform multiple amount of Pandas lookups between two DataFrames

How to get the number of columns and the width of each column in a Pandas pivot table?

Display a column when a desired value is missing while grouping in Pandas dataframe

Python hide ticks but show tick labels