网页抓取工具 API 支持多种格式的数据提取,包括 JSON、NDJSON、JSONL 和 CSV。您可在请求参数中指定所需格式。
网页抓取工具 API 的费用是多少?
We charge based on the number of records we delivered, you only pay for what you get, do note that unsuccessful attempts resulting from incorrect inputs by the user will still be billed. Since the failure to retrieve data was due to user input rather than our system’s performance, resources were still consumed in processing the request. The rate per record depends on your subscription plan (starting from 0.7$ per 1000 records). Check our pricing plans or your account details for specific rates.
如果我的 API 令牌过期了,该怎么办?
账户管理员:如果 API 令牌过期,则需在“账户设置”中创建一个新的 API 令牌。
账户用户:如果 API 令牌过期,请联系账户管理员,让其发放新令牌。
抓取工具 API 如何管理大规模的数据提取任务?
抓取工具 API 具有高并发和批处理功能,在大规模数据提取场景中表现出色。 这确保开发人员可以高效扩展其抓取操作,通过高吞吐量满足海量的数据提取请求。
网页抓取工具 API 可广泛用于各种使用场景,包括竞争基准测试、市场趋势分析、动态定价算法、情绪提取以及为机器学习管道提供数据。 这些 API 对于电子商务、金融科技和社交媒体分析至关重要,有助开发人员高效执行数据驱动型策略。
网页抓取工具 API 的响应速度如何?
We offer real-time support for scrapers using URLs as inputs, with up to 20 URL inputs, and batch support for more than 20 inputs, regardless of the scraper type.The 网页抓取工具 API delivers real-time data for up to 20 inputs per call, with response times varying by domain, ensuring fresh data without relying on cached information.Scrapers that discover new records (e.g., “Discover by keyword,” “Discover by hashtag”) generally take longer and use batch support, as the actual response times can be influenced by several factors, including the target URL’s load time and the execution duration of user-defined Page Interactions.
How do I cancel an API call?
You can cancel a run using the following endpoint:curl -H “Authorization: Bearer TOKEN” -H “Content-Type: application/json” -k “https://api.brightdata.com/datasets/v3/snapshot/SNAPSHOT_ID/cancel” -X POSTMake sure the snapshot id is the one you want to cancel.Note: If you cancel the run no data will be delivered to you and a snapshot can’t be canceled after it finished collecting
What is the difference between a notify URL and a webhook URL configurations?
The key difference between a notify URL and a webhook URL in API configurations lies in their purpose and usage:Notify URL:Typically used for asynchronous communication.
The system sends a notification to the specified URL when a task is completed or when an event occurs.
The notification is often lightweight and doesn’t include detailed data but may provide a reference or status for further action (e.g., “Job completed, check logs for details”).Webhook URL:Also used for asynchronous communication but is more data-centric.
The system pushes detailed, real-time data payloads to the specified URL when a specific event occurs.
Webhooks provide direct, actionable information without requiring the client to poll the system.Example Use Case:A notify URL might be used to inform you that a scraping job is finished.
A webhook URL could send the actual scraped data or detailed metadata about the completion directly to you.
For how long a snapshot is available after I triggered a collection?
The snapshot is available for 30 days,
you can retrieve the snapshot during this time period via delivery API options and the snapshot ID