A variety of techniques exist for deriving information from diverse sources. These procedures range from simple collecting publicly available web pages using automated programs to more complex processes involving APIs and sophisticated applications. Web harvesting, while commonly used, needs to consider compliance regulations and website agreements