Выбор всех тегов сценария в HTML с помощью Scrapy в Django

Я хочу получить все теги сценариев из url, но есть проблема, с которой я сталкиваюсь.

Предположим, что url является: https://somewebsite.com/

Как выглядит моя функция разбора в моем классе паука (который является очень базовым):

def parse(self, response, **kwargs):
    """ Parses the response. """
    script_elements = response.css("script")
    # There are lots of script tags but most of them are missing.
    # Let's say, in website, there are 25 script tags. 
    # The code above will return 3 or 4 of them.
    parsed_result = list()
    for script_element in script_elements:
        script = script_element.extract()
        # other code blocks
    return parsed_result

Но когда я запускаю scrapy с помощью shell вот так:

scrapy shell https://somewebsite.com/

А когда я выбираю теги скрипта в shell:

response.css("script").extract()

Я получил именно то, что хотел. Все скриптовые теги в HTML есть. Почему такая разница? Я запускаю Scrapy с веб-приложением Django (не знаю, имеет ли это какое-то значение)

Вернуться на верх

Последние вопросы и ответы

Deploying Django backend and React frontend completely free — production-ready options?

PyCharm не видит импорт представлений в Django (Unresolved reference / ModuleNotFoundError)

Django on Elastic Beanstalk not detecting RDS environment variables (falls back to SQLite)

How to synchronize a locale until a defined level/page?

Why am I getting this key error when using django-formtools and django-allauth together

How to hide GraphQL exceptions in strawberry and Django?

Django add to cart function

How can I use Django Allauth and Google Identity Services (GSI) simultaneously for Google login?

how to create a custom activity logs of user [closed]

Can this query be expressed in Django?

Выбор всех тегов сценария в HTML с помощью Scrapy в Django

Последние вопросы и ответы

Рекомендуемые записи по теме