Data Preprocessing

Discussion 1:In today’s world, data is being generated from various sources and in various formats; as the internet utilization is drastically increasing from different devices like sensors, cc cameras, laptops, workstations, tablets and iPad’s; the data available from internet is in unstructured formats and available in the form of text files, pdf files, images, videos, tweets and other formats (García, Luengo & Herrera, 2015). The collected is not normalized, clean, availability of incomplete data, de-normalized and unprocessed data. Using direct raw or unprocessed data produced false results and it is not useful for analytics.To process the data and used for the analytics, the quality of data is based on the three factors like accuracy, completeness, and consistency. Initially the data need to be accurate where the inaccuracy causes by human enters random data or chance of entering error data so incorrect and duplication of data causes inaccuracy in data processing. The other factor make sure is completeness where the incomplete data caused by data unavailability, and deleting consistent data. The third factor is consistency, to process the data in order to produce the analytical results maintaining the consistent data is one of the key factors.To perform various analysis where using processed data helps in generating various graphs and tables in decision making. The four stages that include preprocessing the data are data cleaning, data integration, data reduction and data transformation (Kamiran, & Calders, 2012). The first stage data cleaning involves identifying the missing values and eliminating noisy data. In order to remove noisy data different techniques used are binning, regression and outlier analysis. The second stage is data integration- data is being collected from various sources it is necessary to integrate the data to identify the related or correlated data. The third stage is data reduction- using different techniques data reduction helps in eliminating the duplicate data and reduces large volumes of data. Final stage is data transformation- data transformation helps in forming appropriate data in performing various algorithms and analytic techniques.ReferencesGarcía, S., Luengo, J., & Herrera, F. (2015). Data preprocessing in data mining (pp. 195-243). Cham, Switzerland: Springer International Publishing.Kamiran, F., & Calders, T. (2012). Data preprocessing techniques for classification without discrimination. Knowledge and Information Systems, 33(1), 1-33.Discussion 2:Why are the original/raw data not readily usable by analytics tasks?Raw data is usually dirty, inaccurate and misaligned. This means that it cannot be utilized in its raw format (Sharda et al., 2020). Moreover, raw data can be unstructured and overly complicated. This means that data analytics have to be performed to transform raw data into refined data (Sharda et al., 2020). Therefore, data analytics is a critical approach to transform raw data into refined data.What are the main data preprocessing steps?The process starts with data consolidation, which collects, selects and integrates data. It may involve filtering any unnecessary data before its adequately utilized. The next step data cleaning, which ensures that errors are removed from the data (Sharda et al., 2020). Moreover, in this step, data is usually imputed and eliminates any duplication of data. The third step, data transformation, involves standardization, where data is placed in a range between the smallest and largest data. Nevertheless, discretion involves the categorization of data into different classifications (Alasadi & Bhaya, 2017). In data transformation, there is the creation of different attributes of data. The last step in data preprocessing is data reduction, which ensures reduced dimension, reduced volume and balanced data (Alasadi & Bhaya, 2017). The last step ensures that there is no too much data, which may be challenging to handle.List and explain their importance in analytics.Data consolidation, the first step, is essential because it allows for data collection, selection and integration. In this step, all the unnecessary data is usually eliminated to ensure that only appropriate data is available (Losarwar, V., & Joshi, 2012). In data cleaning, data scrubbing is vital because it ensures that all the data with errors is removed. Moreover, the step ensures that there is a reduction in duplication, removing data redundancy. Data transformation enables easier categorization of data (Alasadi & Bhaya, 2017). This is important because when data is organized into categories, it can efficiently be utilized, which would be impossible when data is unstructured (Sharda et al., 2020). Data reduction enables data balancing to ensure that some of the data is not over or under-sampled. Therefore, the process of preprocessing is necessary for data analytics.

Don't use plagiarized sources. Get Your Custom Essay on
Data Preprocessing
From $8/Page
Order Essay
Full satisfaction or return your money back

Our experts will write you a top-quality paper and revise it an unlimited number of times until you're 100% satisfied - or offer a refund.

Check Prices
Get the best discount and better grades here
Pages (550 words)
$0.00
*Price with a welcome 15% discount applied.
Pro tip: If you want to save more money and pay the lowest price, you need to set a more extended deadline.
We know that being a student these days is hard. Because of this, our prices are some of the lowest on the market.

Instead, we offer perks, discounts, and free services to enhance your experience.
Sign up, place your order, and leave the rest to our professional paper writers in less than 2 minutes.
step 1
Upload assignment instructions
Fill out the order form and provide paper details. You can even attach screenshots or add additional instructions later. If something is not clear or missing, the writer will contact you for clarification.
How to get the most out of your experience with Toptutor4me
One writer throughout the entire course
If you like the writer, you can hire them again. Just copy & paste their ID on the order form ("Preferred Writer's ID" field). This way, your vocabulary will be uniform, and the writer will be aware of your needs.
The same paper from different writers
You can order essay or any other work from two different writers to choose the best one or give another version to a friend. This can be done through the add-on "Same paper from another writer."
Copy of sources used by the writer
Our college essay writers work with ScienceDirect and other databases. They can send you articles or materials used in PDF or through screenshots. Just tick the "Copy of sources" field on the order form.
Here is what our happy clients say about us.
Receiving customer feedback and reviews from legit students who have utilized our services is one of our favorite part of the job. We love seeing testimonials from happy students whom we have helped to succeed. Read through the below review page to see what our partners think.
Psychology
Thank you so much for all of your hard work, appreciate every bit of it!
Customer 452483, September 6th, 2021
Biology (and other Life Sciences)
Power Point Presentation was great, appreciate all of your hard work. Thank you!
Customer 452483, August 9th, 2021
Group Dynamics
Great job. I appreciate it.
Customer 452521, November 22nd, 2021
Healthcare Writing & Communications
Thank you so much for all of your hard work! Appreciate it all!
Customer 452483, November 14th, 2021
English 101
Perfect! Thank you so much for all of your hard work and help! Appreciate it!!
Customer 452483, August 30th, 2021
Communications
Well done! I thank you very much!
Customer 452483, July 9th, 2021
Sociology
Perfect! Appreciate all of your help! Thank you so much!
Customer 452483, November 14th, 2021
Education
Great good
Customer 452555, February 14th, 2022
Communications
Thank you for your hard work; I enjoyed reading the essay and appreciate your writing.
Customer 452483, July 18th, 2021
Nursing
Excellent. Thank you.
Customer 452487, August 26th, 2021
English 101
Thank you guys for always being there and helping me always get a 100% on my assignments!
Customer 452483, August 16th, 2021
Nursing
Excellent! Thank you.
Customer 452487, October 16th, 2021
11,595
Total Reviews
98%
Satsfaction rate
6 pages
Average paper length
40%
Customers referred by a friend
Enjoy the best prices and lifetime 15% discount
Use a coupon TOP15 and enjoy expert help with any task at the most affordable price.
Order Now Order in Chat