20 lines
554 B
Markdown
20 lines
554 B
Markdown
|
Interviews
|
||
|
========================
|
||
|
|
||
|
|
||
|
Some of the data we process is in a mix of cs and pdf, how would you deal with processing that.
|
||
|
|
||
|
We deal with a lot of data from different sources in different spoken languages, how would you normalise this data?
|
||
|
|
||
|
|
||
|
|
||
|
The data we get can be quite raw and messy, how would you ensure that it as accurate and useable?
|
||
|
|
||
|
4) Mention what is data cleansing?
|
||
|
|
||
|
Data cleaning also referred as data cleansing, deals with identifying and removing errors and inconsistencies from data in order to enhance the quality of data.
|
||
|
|
||
|
|
||
|
|
||
|
|