Incluído enXLSTAT-Base XLSTAT-Sensory XLSTAT-Marketing XLSTAT-Forecast XLSTAT-Biomed XLSTAT-Ecology XLSTAT-Psy XLSTAT-Quality XLSTAT-Premium
Dataset for removing duplicatesAn Excel sheet with both the data and the results can be downloaded by clicking on the button below:
Download the data
The data are fictitious and were created for this tutorial. They represent a sample of sales records of an online shop including the order ID, the customer ID and invoice amount.
Goal of this tutorialDeduping is necessary when observations are mistakenly duplicated (or repeated) due to input errors. Here, we want to clean the data from duplicated rows in order to obtain a table with the unique sales records.
Setting up a duplicate removal with XLSTAT1. Once XLSTAT is open, select the Data Management command under the Preparing data menu as shown below.
2. The Data management dialog box appears.
3. Select columns A, B and C in the Data field. Then select the Dedupe method. Headers are included in our data selection, so we check the Variable labels.
Click on the OK button. An XLSTAT report will be generated in a new sheet named Dedupe.