We are a price comparison listing products sold by various merchants.
I need someone to make the following software.
Should have an option to upload the CSV file which will have the data.
Once the CSV is uploaded, the software should do this :
There will be alot of column, but our focus is only on the name column. In the name column If there are two merchant selling a product named "iPhone 7 Apple", another merchant selling the same product may name it as "Apple iPhone 7" If you notice the case over here, it is the same products being sold by different merchants. But they have named it differently. Due to it being named differently the product names won’t appear one after the other. I need a software which has to detect similar products which is named differently and align them one after the other. If there is even more than two or how much ever merchants selling the same products with the name being spelt differently. The algorithm should be able to do the same for that too.
Please look at the attached file for a better understanding as there is an example of what needs to be done.
The software should be able to handle 100,000+ Rows and also should be able to add more data’s in the future. And especially it should be fast to handle and handover the results.
Need it to have 100% ccyracy.
The CSV will have many columns, but the software should work only one column and the rest should sort accordingly.