Check if email is valid
Check if email is valid
binary feature containing boolean value of whether email was valid format
Extract email domains
Extract email domains
email domain
Extract email prefixes
Extract email prefixes
email prefix
Converts a sequence of Email features into a vector, extracting the domains of the e-mails and keeping the top K occurrences of each feature, along with an extra column per feature indicating how many values were not in the top K.
Converts a sequence of Email features into a vector, extracting the domains of the e-mails and keeping the top K occurrences of each feature, along with an extra column per feature indicating how many values were not in the top K.
How many values to keep in the vector
If true, ignores capitalization and punctuations when grouping categories
Min times a value must occur to be retained in pivot
keep an extra column that indicated if feature was null
Other Email features
max percentage of distinct values a categorical feature can have (between 0.0 and 1.00)
The vectorized features