gallery-dl

Commit Graph

Author	SHA1	Message	Date
Mike Fährmann	4b2a0a0eda	[twitter] implement 'strategy' option (#2712 ) to be able to better control what Tweets get used an returned for twitter.com/USER URLs.	2 years ago
Mike Fährmann	7b073bf9ef	Revert "[twitter] improve strategy for user URLs (#2665 )" 'user_tweets_and_replies' was a mistake	2 years ago
Mike Fährmann	d6c6c8a4a0	[twitter] improve '"replies": "self"' (#2665 ) If a username is given in the input URL, only download from replies by that user.	2 years ago
Mike Fährmann	9c8d895d19	[twitter] implement 'csrf' option (#2676 )	2 years ago
Mike Fährmann	08db8435f1	[twitter] fix pagination for conversion tweets a relic from the switch to GraphQL API	2 years ago
Mike Fährmann	1da3ccf608	[twitter] implement 'expand' option (#2665 )	2 years ago
Mike Fährmann	0add1fc090	[twitter] improve strategy for user URLs (#2665 ) - use '/with_replies' when appropriate - consider 'text-tweets' - build search query as necessary	2 years ago
thatfuckingbird	da0696e1f5	recognize vxtwitter URLs (#2621 )	2 years ago
Mike Fährmann	dcb580240d	[twitter] extract alt texts as 'description' (closes #2617 )	2 years ago
Mike Fährmann	915dba8345	[twitter] improve results for regular user URLs - continuation of `3346f58a` - use media timeline results (or tweet timeline if retweets are enabled) plus search results starting from the last tweet id of the first timeline, similar to how Twitter Media Downloader operates - the old behavior can be forced by appending '/tweets' to a user URL, like with '/media' (https://twitter.com/USER/tweets) although there should be no need to ever do that	2 years ago
Mike Fährmann	9df4e0f65b	[twitter] disable 'cards' by default	2 years ago
Mike Fährmann	3346f58a2a	[twitter] use twMediaDownloader strategy for user URLs - use media timeline + search for default user URLs like https://twitter.com/SCREEN_NAME - fetches all/most media for the type of twitter URL that most users use with gallery-dl - can be disabled by setting 'strategy' to any truthy value, like "timeline"	2 years ago
Mike Fährmann	ad5a4b1756	[twitter] fix various syndication issues - handle retweets - fix videos without dimensions in URL (`3e942a58`) - fix '"retweets": "self"' filter (#2499)	2 years ago
Mike Fährmann	3e942a58be	[twitter] improve syndication video selection (#2354 ) - ignore .m3u8 manifests - always select largest format	2 years ago
thatfuckingbird	4527a35aba	[twitter] accept fxtwitter.com URLs (#2484 )	2 years ago
Mike Fährmann	1171911dc3	[twitter] add 'syndication' option (#2354 ) to fetch age-restricted content using Twitter's syndication API	3 years ago
Mike Fährmann	2aa47e8382	[twitter] handle Tweets with "softIntervention" entries or other such things where the actual Tweet data is one level deeper than usual	3 years ago
Mike Fährmann	64bbc7969d	[twitter] warn about age-restricted Tweets (#2354 )	3 years ago
Mike Fährmann	e778be52bc	[twitter] update query hashes	3 years ago
Mike Fährmann	4385a34e05	[twitter] fix handling of 429 responses (fixes #2339 ) Twitter doesn't return a valid JSON response for 429 errors anymore.	3 years ago
Mike Fährmann	bc0e853d30	combine KeyError & IndexError to common base class LookupError	3 years ago
Mike Fährmann	0f1e7ff319	[twitter] fix extraction (#2275 )	3 years ago
Mike Fährmann	70e6e1549e	[twitter] provide fallback URLs for card images `f2e8aedd74 (commitcomment-64057751)`	3 years ago
Mike Fährmann	492436f936	[twitter] add 'warnings' option (#2258 ) disable reporting any non-fatal errors by default	3 years ago
Mike Fährmann	a5163e4c70	[twitter] restore 'logout' functionality (#1719 )	3 years ago
Mike Fährmann	d33227fc38	[twitter] restore errors for protected timelines etc (fixes #2237 )	3 years ago
Mike Fährmann	8230f31800	[twitter] update query hashes	3 years ago
Mike Fährmann	c180806cec	[twitter] fix deleted/invalid retweets (#2225 )	3 years ago
Mike Fährmann	2bf554a896	[twitter] fix several errors (#2212 , #2216 , #2225 ) - fix Tweets with deleted quotes - fix suspended Tweets without 'legacy' entry - fix unified_cards without 'type'	3 years ago
Mike Fährmann	e5242b83bf	[twitter] define directory format for events (#2109 )	3 years ago
Mike Fährmann	5ed26e1773	[twitter] fix pinned tweets (#2216 ) caused by the changes in `dffa440ede`	3 years ago
Mike Fährmann	a9f78e6527	[twitter] improve error handling - handle accounts without 'rest_id' - handle timelines with empty 'instructions'	3 years ago
Mike Fährmann	729b07c1f5	[twitter] simplify - use dict with common GraphQL variables - reduce 'variables' size with custom JSON encoder instance - centralise TwitterAPI() creation	3 years ago
Mike Fährmann	9ca8bb2dc0	[twitter] improve error handling	3 years ago
Mike Fährmann	9a221494c3	[twitter] add 'event' extractor (closes #2109 )	3 years ago
Mike Fährmann	14867dad6b	[twitter] fix unified cards from search results	3 years ago
Mike Fährmann	dffa440ede	[twitter] improve handling of deleted tweets (#2212 )	3 years ago
Mike Fährmann	54ef874ba4	[twitter] fix retweet filter (#2212 )	3 years ago
Mike Fährmann	cb43f7731b	[twitter] update to GraphQL API (#2212 ) The old REST API endpoints, which were not used by Twitter since summer 2021, are going to finally be phased out it seems, with '/2/timeline/profile/USERID.json' being the first one. Only Twitter's search doesn't have a GraphQL interface yet.	3 years ago
Mike Fährmann	f2e8aedd74	[twitter] changes to 'cards' option - change default value to 'true' - only invoke youtube-dl for cards unsupported by gallery when 'cards' is set to "ytdl" "cards": true --> only download card images "cards": "ytdl" --> download card images and use youtube_dl on otherwise unsupported cards	3 years ago
Mike Fährmann	df2f0c09bb	[twitter] support "image_carousel_website" unified cards	3 years ago
Mike Fährmann	f587458a3c	[twitter] include '4096x4096' as a default image fallback (closes #2107, closes #1881)	3 years ago
Mike Fährmann	ab8eea1a24	[twitter] fix extractor for direct image links (fixes #2030 )	3 years ago
Mike Fährmann	4377f1c284	[twitter] distinguish between fatal & nonfatal errors (#2020 ) only show a warning for nonfatal errors and do not raise a StopExtraction exception	3 years ago
Mike Fährmann	9156e90f1f	[twitter] add 'pinned' option	3 years ago
Mike Fährmann	cd66c3c415	[twitter] add 'size' option (#1881 )	3 years ago
Mike Fährmann	94143eb86c	[twitter] add 'quote_by' metadata field (#1481 ) Only present for tweets quoted by another tweet. Represents the tweet_id of said tweet quoting this one.	3 years ago
Mike Fährmann	da16eabb82	[twitter] ensure card entries have a 'url' (#1868 )	3 years ago
Mike Fährmann	0fd959a2a7	[twitter] support '/with_replies' URLs (closes #1833 )	3 years ago
Mike Fährmann	6651da27e9	[twitter] fix 'url' extraction for users without 'expanded_url' (#1532, #1787)	3 years ago
Mike Fährmann	ae78d95a5f	[twitter] fix issue when filtering quote tweets (#1792 ) When a user quotes his own Tweet and that Tweet gets filtered by '"quoted": false', it could also get filtered when it appeared later as regular Tweet.	3 years ago
Mike Fährmann	0817f468ef	[twitter] expand t.co links in user descriptions (#1532 , #1787 )	3 years ago
Mike Fährmann	7c0ae88185	[twitter] add 'url' to user objects (#1532 , #1787 )	3 years ago
Mike Fährmann	5919dc5b5a	[twitter] slightly improve '_transform_user()'	3 years ago
Mike Fährmann	6b56b3ebe1	[twitter] report API errors as generic StopExtraction exceptions prevents duplicate logging messages for nonexistent users (#1759)	3 years ago
Mike Fährmann	c866fcba48	[twitter] fix 'logout' (#1719 ) delete 'auth_token' cookie and cookies.txt path	3 years ago
Mike Fährmann	52984f7e22	[twitter] add option to log out when blocked (#1719 )	3 years ago
Mike Fährmann	e5a93e113f	[twitter] extend 'replies' option (#1254 ) Allow setting 'replies to '"self"' to only download from self-replies.	3 years ago
Mike Fährmann	229498b8aa	[twitter] warn about suspended accounts etc (closes #1759 )	3 years ago
Mike Fährmann	414bdc95a3	[twitter] set 'retweet_id' for original retweets (#1481 )	3 years ago
Mike Fährmann	5323c1c73a	[twitter] ensure guest tokens are returned as string (#1665 )	3 years ago
Mike Fährmann	035562bd11	[twitter] remove old-style URLs from image fallback lists	3 years ago
Mike Fährmann	a751afdfb3	[twitter] change some defaults - 'retweets' option: true -> false - 'quoted' option : true -> false i.e. disable downloading tweets from other user's timelines by default - search directory: '["{category}", "Search", "{search}"]' -> '["{category}", "{user[name]}"]' i.e. change it to the same as other twitter extractors (#1308)	3 years ago
Mike Fährmann	b5affc62aa	[twitter] rename 'text-only' to 'text-tweets' (#570 )	3 years ago
Mike Fährmann	724ca61f36	[twitter] add 'text-only' option (#570 )	3 years ago
Mike Fährmann	394fbb5f56	[twitter] strip useless t.co links (#1532 ) The 'full_text' of Tweets with media content usually ends with a t.co link to itself. This commit removes those.	3 years ago
Mike Fährmann	41457dbb1b	[twitter] resolve t.co URLs in 'content' (#1532 )	3 years ago
Mike Fährmann	17b0ccb071	[twitter] add missing retweet media entities (fixes #1555 ) from the original tweets	3 years ago
Mike Fährmann	fd858eed7b	[twitter] add 'user_likes' metadata field for liked tweets i.e. the 'screen_name' of the user whose liked tweets get extracted. Ideally this would replace 'user' or at least be in the same format, but that would break backwards compatibility or be impossible/too complicated thanks to API result differences. (#1421)	4 years ago
Mike Fährmann	8d124a3766	[twitter] rename variables	4 years ago
Mike Fährmann	105f3c9666	[twitter] add extractor for direct image links (closes #1417 )	4 years ago
Mike Fährmann	ebd142e2a8	[twitter] don't use youtube-dl for cards when videos are disabled (#1416)	4 years ago
Mike Fährmann	ccfa5a8694	[twitter] better error message when logging in with 2FA (#1409 )	4 years ago
Mike Fährmann	2846235669	[twitter] allow specifying a custom format for user results (#1337)	4 years ago
Mike Fährmann	3378b39719	[twitter] implement 'users' option (#1337 )	4 years ago
Mike Fährmann	5d69e437d0	[twitter] add option to download all media from a conversation (fixes #1319)	4 years ago
Mike Fährmann	de0656941b	[twitter] add extractor for followed users (#1337 ) https://twitter.com/USER/following or https://twitter.com/id:USERID/following	4 years ago
Mike Fährmann	5542a11c46	[twitter] update GraphQL endpoints	4 years ago
Mike Fährmann	24e8e398e0	[twitter] skip login if 'auth_token' cookie is present	4 years ago
Mike Fährmann	95e5911895	[twitter] match '/i/user/ID' URLs	4 years ago
Mike Fährmann	069b113cbf	[twitter] improve and fix retry after hitting rate limit - replace recursive call with infinite loop - fix function arguments for recursive call	4 years ago
Mike Fährmann	780b6adb91	rename 'generate_csrf_token()' to just 'generate_token()' and add a 'size' argument	4 years ago
Mike Fährmann	25074aec47	[twitter] fetch media from pinned tweets (#1203 )	4 years ago
Mike Fährmann	2475176d99	[twitter] fetch tweets from 'homeConversation' entries When logged in, some entries returned by Twitter's API are so called 'homeConversation's (they would be regular tweet entries otherwise.) Those weren't picked up before and resulted in missing files compared to accessing a timeline as guest. ('/media' timelines and search results were not affected)	4 years ago
Mike Fährmann	3af9350648	[twitter] update API calls - use 'https://twitter.com/i/api' for all requests except '/guest/activate.json' - update (default) URL parameters - update GraphQL endpoints	4 years ago
Mike Fährmann	b656b829db	[twitter] fix login with username & password It is no longer possible to get an 'authenticity_token' from Twitter's Javascript-free login form, which got disabled few days ago. Generating a random 16 byte hex string client-side and sending that as a cookie alongside the regular login form works just as well.	4 years ago
Mike Fährmann	a00b60fbe7	[twitter] update 'x-csrf-token' header (fixes #1170 ) Twitter started using a bigger (80 instead of 16 bytes) CSRf token for logged in users, and expects those to be used as 'x-csrf-token' header when send via 'ct0' cookie. Generating an 80 byte token ourselves doesn't work, and Twitter will still insist on using its own.	4 years ago
Mike Fährmann	63e61a0932	[twitter] update image URL format (#1145 ) use '/<name>?format=<fmt>&name=<size>' instead of the potentially deprecated '/<name>.<fmt>:<size>' but keep all of them as fallback URLs	4 years ago
Mike Fährmann	ddfb4fd07a	[twitter] use 'https://twitter.com/i/api/' for logged in users Doesn't seem to make a difference from what I can tell, i.e. downloaded files are the same, but the website does it.	4 years ago
Mike Fährmann	de0c57886d	[twitter] add 'list-members' extractor (closes #1096 )	4 years ago
Mike Fährmann	41d4968866	[twitter] add 'list' extractor (#1096 )	4 years ago
Mike Fährmann	5d10520f4c	[twitter] update GraphQL endpoint & fix width/height entries	4 years ago
Mike Fährmann	968d3e8465	remove '&' from URL patterns '/?&#' -> '/?#' and '?&#' -> '?#' According to https://www.ietf.org/rfc/rfc3986.txt, URLs are "organized hierarchically" by using "the slash ("/"), question mark ("?"), and number sign ("#") characters to delimit components"	4 years ago
Mike Fährmann	1686dc1757	[twitter] support media from Cards (#1005 , #937 ) Can be enabled with 'extractor.twitter.cards', but for now disabled by default because cards can redirect to rather large videos from YouTube or Twitch.	4 years ago
Mike Fährmann	a3ca2f6080	update fallback URL handling remove Message.Urllist and use a '_fallback' field inside a kwdict	4 years ago
Mike Fährmann	1b1cf01d0d	add a general 'generate_csrf_token()' function	4 years ago
Mike Fährmann	844502cad5	update extractor test results	4 years ago
Mike Fährmann	430b6d6e2e	[twitter] extend 'retweets' option (closes #1026 ) Setting 'retweets' to '"original"' will use metadata from the original retweeted Tweets, and not from the Retweet entry.	4 years ago
Mike Fährmann	aeb0d32333	[twitter] improve twitpic extraction (fixes #1019 ) - ignore twitpic.com/photos/… URLs - ignore empty image URLs	4 years ago
Mike Fährmann	2b8d57f0ab	[twitter] support '/intent/user?user_id=…' URLs (#980 )	4 years ago
Mike Fährmann	a3b473bd2f	[twitter] support specifying users by ID (#980 ) by using 'id:…' as their screen name, i.e. https://www.twitter.com/id:2976459548/media instead of https://twitter.com/supernaturepics/media The user ID can, for example, be obtained from the output of $ gallery-dl -j --range 1 https://twitter.com/<screen-name>	4 years ago
Mike Fährmann	8f64585ff2	[twitter] handle 429 responses without x-rate-limit-reset header	4 years ago
Mike Fährmann	2da71cb561	[twitter] raise proper exception if user doesn't exist (#891 )	4 years ago
Leonardo Taccari	86e5a05e29	[twitter] add support for nitter.net URLs in pattern (#890 ) Please note that URLs are only "translated", all requests are still done always via the Twitter API.	4 years ago
Mike Fährmann	3855d0dd3c	[twitter] add debug messages for all skipped Tweets (#867 )	4 years ago
Mike Fährmann	6e2af9a8d8	[twitter] improve error message formatting	4 years ago
Mike Fährmann	9da2bc67f8	[twitter] add option to filter media from quoted tweets (#854 )	4 years ago
Mike Fährmann	56ab5fb8f4	[twitter] improve handling of quoted tweets (#854 ) Split each "quote" into two parts: - the original tweet - the tweet that quoted the original	4 years ago
Mike Fährmann	a8c2d997e8	[twitter] treat quoted tweets like retweets (#833 ) - filter them when 'retweets' is disabled - set 'author' to the creator of the quoted tweet like it was before the rewrite	4 years ago
Mike Fährmann	aed1c63e51	[twitter] improve search results (fixes #847 ) Adding 'tweet_search_mode=live' to the query parameters is the most important part here.	4 years ago
Mike Fährmann	d81a8e6544	[twitter] update tests	4 years ago
Mike Fährmann	d39eedd9bb	[twitter] improve handling of deleted tweets (fixes #838 )	4 years ago
Mike Fährmann	dc16f73965	[twitter] move '_guest_token()' into TwitterAPI class	4 years ago
Mike Fährmann	3561d1020a	[twitter] always provide an 'author' field (#831 , #833 ) The idea was to have less metadata clutter for most Tweets were 'author' and 'user' are the same (non-retweets), and only provide a 'user' field. The original Tweet author could be gotten with {author[…]\|user[…]}, but basically no one knows about that.	4 years ago
Mike Fährmann	c37a1c06c8	[twitter] add extractor for liked tweets (closes #837 ) You need to be logged in to get access to anyone's liked tweets, it seems.	4 years ago
Mike Fährmann	b94394104c	[twitter] don't download video previews (#833 ) when 'videos' is set to False	4 years ago
Mike Fährmann	036a40943a	[twitter] don't cache results of 'user_by_screen_name()' A 'keyarg=1' argument to the memcache decorator would have worked as well, but keeping the user object in memory isn't useful for the vast majority of use cases and only wastes space. (closes #817)	4 years ago
Mike Fährmann	4442dfe7b8	[twitter] add 'reply_to' metadata to replies	4 years ago
Mike Fährmann	d769bb4b80	[twitter] improve pagination	4 years ago
Mike Fährmann	5bc1097f9d	[twitter] metadata cleanup #2 - remove useless clutter by creating new tweet-data dicts instead of reusing the original Tweet objects - rename fields to how they were named before ('id_str' -> 'tweet_id', etc.) - only include 'author' if it would differ from 'user' - restore 'archive_fmt'	4 years ago
Mike Fährmann	3eed5f52d7	[twitter] small metadata cleanup - add 'date' field - remove 'entities' and 'extended_entities' - don't include 'focus_fields' from 'original_info'	4 years ago
Mike Fährmann	655c98cbef	[twitter] skip unavailable tweets	4 years ago
Mike Fährmann	2132e5461a	[twitter] restore TwitPic support	4 years ago
Mike Fährmann	bd0f21478a	[twitter] login using the mobile nojs login page	4 years ago
Mike Fährmann	a10f31dde5	[twitter] rewrite; use new interface (#740 , #806 ) Everything except logging in with username & password and TwitPic embeds should be working again. Metadata per Tweet is massively different than before (mostly raw API responses - might need some cleaning up) and the default 'archive_fmt' changed.	4 years ago
Mike Fährmann	45baa13615	update extractor test results - don't run Instagram tests on Travis anymore - replace Twitter test because timeline was made private - update Hiperdex domain to '.com' (again ...)	4 years ago
Mike Fährmann	9f638c2e01	[twitter] add 'replies' option (closes #705 )	4 years ago
Mike Fährmann	d3b3b30107	update test results	4 years ago
Mike Fährmann	3eab07739f	[twitter] ensure videos have a 'filename' This usually gets set when invoking the 'ytdl' downloader, but when that fails, the error message would use 'None' as filename.	4 years ago
Mike Fährmann	c4371a6970	[twitter] add 'reply' metadata field (#705 )	4 years ago
Mike Fährmann	d02f7c1118	improve Extractor.wait() - allow 'until' to be a datetime object - do "time calculations" with UTC timestamps - set a default 'reason'	5 years ago
Mike Fährmann	b607d0ad7f	[twitter] fix typo in 'x-twitter-auth-type' header (#625 )	5 years ago
Mike Fährmann	2d5703c493	[twitter] use a simpler data structure to store cookies in cache Use a dict with name-value pairs instead of an entire RequestsCookieJar object.	5 years ago
Mike Fährmann	32df8d06fe	[twitter] add 'bookmark' extractor (closes #625 )	5 years ago
Mike Fährmann	19ae6f3fc4	update test results - twitter: Don't test the whole kwdict, only the actual content, since the keyword hash changes whenever that user changes his display name. - khinsider: Download host changed	5 years ago
Mike Fährmann	74e684e828	[twitter] change default value for 'videos' to 'true' Every other 'videos' option defaulted to 'true', except Twitter.	5 years ago
Mike Fährmann	facc5daa6d	[twitter] force old login page layout (fixes #584 , fixes #598 )	5 years ago
Mike Fährmann	e0dd073ce0	[twitter] replace embedded tweet test the old one was deleted	5 years ago
Mike Fährmann	25d5ec4ff3	[twitter] add option to extract TwitPic embeds (#579 )	5 years ago
Alice	f498a9057f	[twitter] Fix stop before real end (#573 ) * [twitter] Fix stop before real end Fix for https://github.com/mikf/gallery-dl/issues/544. Makes sure that it really reached the end by checking that both "min_position" is null and "has_more_items" is false before stopping. * [twitter] Fix stop before real end (update)	5 years ago
Mike Fährmann	43ab9572b4	[twitter] handle API rate limits (#526 )	5 years ago
Mike Fährmann	5532e9c158	[twitter] handle quoted tweets (#526 ) … and categorize them as retweets	5 years ago
Mike Fährmann	896896a490	[twitter] fix URLs forwarded to youtube-dl (closes #540 ) Since commit `3bba763` data["user"] is an entire dict object and no longer just the user nickname …	5 years ago
Mike Fährmann	07dafad26d	[twitter] attempt to fix infinite loops (#499 ) (Hopefully this doesn't break anything else)	5 years ago
Mike Fährmann	3bba763ab9	[twitter] improve - update metadata structure - combine all user… entries into their own dict - let 'user' always specify the Timeline owner - add 'author' entry that specifies the original Tweet author - create directories per post (closes #491) - fix username issues with /i/web/ URLs	5 years ago
Mike Fährmann	5513b66eb0	[vsco] fix user profile extraction	5 years ago
Mike Fährmann	c01ff78467	[twitter] extend 'videos' option to force extraction with ytdl (closes #459)	5 years ago
Mike Fährmann	49a6b1b6c0	[twitter] extract video stream info without youtube-dl (#452 ) This should allow video downloads when logged in without 'forward-cookies' disabled and from protected tweets. youtube-dl still gets used to download HLS playlists, but the data extraction part, which doesn't work with youtube-dl at the moment, now gets handled by gallery-dl itself.	5 years ago
Mike Fährmann	9f0dbf2a72	[twitter] raise proper exception for protected Tweets	5 years ago
Mike Fährmann	2eb38810c5	[twitter] fix image extraction when logged in (#452 ) ... for individual tweets. To get a Tweet page with the old Twitter layout, an Internet Explorer User-Agent (e.g. Mozilla/5.0 (Windows NT 6.1; WOW64; Trident/7.0; rv:11.0) like Gecko) as well as a Referer header pointing to the page itself is required. The "app_shell_visited" cookie appears to be optional at the moment, but that is what a regular web browser would send.	5 years ago
Mike Fährmann	ef17d94469	update test results	5 years ago
Mike Fährmann	1c03a389df	[twitter] small improvements to search extractor - put search results in separate directories - set 'max_position' to '-1' for first request -> prevent duplicate results - add a test - flake8	5 years ago
Alice	bcddcca6db	Add search downloading to twitter.py (#448 ) Adds the functionality to download search results on twitter.com/search. Since twitter only allows downloading of up to 3,200 of a users most recent tweets, you will be unable to download old images from users with a lot of tweets. To bypass this, you can use the twitter search to get the tweets from the sections in time you were stopped at. An example search would be "from:user since:2015-01-01 until:2016-01-01 filter:images". The URL you would use will look something like this https://twitter.com/search?f=tweets&q=from%3Asupernaturepics%20since%3A2015-01-01%20until%3A2016-01-01%20filter%3Aimages&src=typd&lang=en The _tweets_from_api function had to be changed because it would not get the next page of results using the last "data-tweet-id". It would return the same JSON but with a "min_position" string added. Using this string for the "max_position" param from the second page onwards correctly returned the next pages. This change does not interfere with how the other extractors work as far as I know. The 2 regex patterns in the extractors had to be changed to not match the search URL.	5 years ago
Mike Fährmann	66cac207ac	[twitter] match and use 'i/web' status URLs	5 years ago
Mike Fährmann	e7690ac694	[vsco] update URL pattern (closes #410 )	5 years ago
Mike Fährmann	bc0ca66c99	[twitter] small improvements - handle reply tweets (#403) - unset cookies in Tweet extractor to "force" the legacy interface	5 years ago
Mike Fährmann	23251356cb	require 'extension' data for each URL (#382 )	5 years ago
Mike Fährmann	feb98cf196	[twitter] improve 'content' formatting; add option (#338 ) - include emoticons - leave newlines intact - remove pic.twitter.com/ links at the end	5 years ago
Mike Fährmann	0151e250f5	[twitter] extract 'content' metadata (closes #333 )	5 years ago
Mike Fährmann	8de5866fd2	[twitter] replace unit test URLs https://twitter.com/PicturesEarth was deleted	5 years ago
Mike Fährmann	049e9fd6ce	[twitter] fix pagination end condition Some timelines would cause an endless loop because 'has_more_items' is always True, even if it would return the same list of tweets over and over again.	5 years ago
Mike Fährmann	dcc1592dbf	[twitter] add fallback URLs (#237 )	5 years ago
Mike Fährmann	6264a46212	use 'utcfromtimestamp()' 'fromtimestamp()' converts its results to the local timezone and causes problems when running tests on a different machine.	5 years ago
Mike Fährmann	d84e7c6861	[twitter] extract 'date' metadata (#224 )	5 years ago
Mike Fährmann	f2cf1c1d73	use 'text.extract_from()' in a few places	5 years ago
Mike Fährmann	e730fc9045	[twitter] add login support (#214 )	6 years ago
Mike Fährmann	5530871b5a	change results of text.nameext_from_url() Instead of getting a complete 'filename' from an URL and splitting that into 'name' and 'extension', the new approach gets rid of the complete version and renames 'name' to 'filename'. (Using anything other than {extension} for a filename extension doesn't really work anyway) Example: "https://example.org/path/filename.ext" before: - filename : filename.ext - name : filename - extension: ext now: - filename : filename - extension: ext	6 years ago
Mike Fährmann	4b1880fa5e	propagate 'match' to base extractor constructor	6 years ago
Mike Fährmann	6284731107	simplify extractor constants - single strings for URL patterns - tuples instead of lists for 'directory_fmt' and 'test' - single-tuple tests where applicable	6 years ago
Mike Fährmann	baad7b0fa5	[twitter] unpack API responses when logged in (closes #123 )	6 years ago
Mike Fährmann	1532d1b690	fix 'range' tests and update a few test results	6 years ago
Mike Fährmann	188876d814	implement youtube-dl downloader module URLs starting with 'ytdl:' will now be handled by youtube-dl. There is probably a lot to fix and improve, but the basic use case works. TODO: - format selection and ytdl options in general - better filename/path handling - ytdl support for "unsupported URLs" - ...	6 years ago
Mike Fährmann	f8b3b00249	[twitter] add experimental 'videos' option (#99 ) Enabling this option will detect videos in tweets and output them as "unsupported" URLs, so that these can then be downloaded with youtube-dl There are a lot of improvements to be made to the current implementation, but it works and does what it is supposed to, even if inefficient as can be ...	6 years ago
Mike Fährmann	e9dd2eff1d	[twitter] add extractor for media-tweet timelines (#96 ) For example "https://twitter.com/PicturesEarth/media". They are different from normal timelines in that they do not contain any (re)tweets from other users and feature all media the user ever posted, including responses to other tweets.	6 years ago
Mike Fährmann	9b1c39032c	[twitter] changes and improvements - rename User- to TimelineExtractor - rename 'userid' to 'user_id' to conform to the other ..._id values - adjust archive_fmt to deal with retweets - emulate browser behavior for API calls	6 years ago
Mike Fährmann	10365394d7	[twitter] add support for user-timelines (closes #96 ) also adds a 'retweets' option to filter retweeted content	6 years ago
Mike Fährmann	34873dbd90	set 'archive_fmt' values These are going to be used to create an unique id for each image.	7 years ago
Mike Fährmann	e6814aebe2	add 'extractor.*.user-agent' config option	7 years ago
Mike Fährmann	6f30cf4c64	change keyword names to valid Python identifiers This commit mostly replaces all minus-signs ('-') in keyword names with underscores ('_') to allow them to be used in filter-expressions. For example 'gallery-id' got renamed to 'gallery_id'. (It is theoretically possible to access any variable, regardless of its name, with 'locals()["NAME"]', but that seems a bit too convoluted if just 'NAME' could be enough)	7 years ago
Mike Fährmann	852e7acd31	[twitter] ignore "Promoted Tweets"	7 years ago
Mike Fährmann	c84e975dcb	[twitter] fix image extraction	8 years ago
Mike Fährmann	94e10f249a	code adjustments according to pep8 nr2	8 years ago
Mike Fährmann	4553a6392f	[whentai] add unittests	8 years ago
Mike Fährmann	3a7421a6ce	[twitter] get 'original' instead of 'large' image	8 years ago
Mike Fährmann	bf8d88499a	[twitter] add extractor	8 years ago

... 2 3 4 5 6 ...

335 Commits (0fcd60349855c01ab38c113f946b7c954dfbb3d3)