Package Search Help

You can use boolean logic (e.g. AND/OR/NOT) for complex search queries. For more help and examples, see the search documentation.

Search by package name:
my-package (implicit)
name:my-package (explicit)

Search by package filename:
filename:my-package.ext 

Search by package tag:
tag:latest 

Search by package version:
version:1.0.0  prerelease:true (prereleases)
prerelease:false (no prereleases)

Search by package architecture:
architecture:x86_64 

Search by package distribution:
distribution:el 

Search by package license:
license:MIT 

Search by package format:
format:deb 

Search by package status:
status:in_progress 

Search by package file checksum:
checksum:5afba 

Search by package security status:
severity:critical 

Search by package vulnerabilities:
vulnerabilities:>1 
vulnerabilities:<1000 

Search by # of package downloads:
downloads:>8 
downloads:<100 

Search by package type:
type:binary 
type:source 

Search by package size (bytes):
size:>50000 
size:<10000 

Search by dependency name/version:
dependency:log4j 
dependency:log4j=1.0.0 
dependency:log4j>1.0.0 

Search by uploaded date:
uploaded:>"1 day ago" 
uploaded:<"August 14, 2022 EST" 

Search by entitlement token (identifier):
entitlement:3lKPVJPosCsY 

Search by policy violation:
policy_violated:true
deny_policy_violated:true
license_policy_violated:true
vulnerability_policy_violated:true

Search by repository:
repository:repo-name

Search by last download date:
last_downloaded:<"30 days ago" 
last_downloaded:>"August 14, 2022 EST" 

Search queries for all Debian-specific (and related) package types

Search by component:
deb_component:unstable

Search queries for all Maven-specific (and related) package types

Search by group ID:
maven_group_id:org.apache

Search queries for all Docker-specific (and related) package types

Search by image digest:
docker_image_digest:sha256:7c5..6d4
(full hashref only)

Search by layer digest:
docker_layer_digest:sha256:4c4..ae4
(full hashref only)

Search queries for all Generic-specific package types

Search by file path:
generic_filepath:path/to/file.txt

Search by directory:
generic_directory:path/to

Field type modifiers (depending on the type, you can influence behaviour)

For all queries, you can use:
~foo for negation

For string queries, you can use:
^foo to anchor to start of term
foo$ to anchor to end of term
foo*bar for fuzzy matching

For number/date or version queries, you can use:
>foo for values greater than
>=foo for values greater / equal
<foo for values less than
<=foo for values less / equal

Need a secure and centralised artifact repository to deliver Alpine, Cargo, CocoaPods, Composer, Conan, Conda, CRAN, Dart, Debian, Docker, Generic, Go, Helm, Hex, HuggingFace, LuaRocks, Maven, npm, NuGet, P2, Python, RedHat, Ruby, Swift, Terraform, Vagrant, VSX, Raw & More packages?

Cloudsmith is the new standard in Package / Artifact Management and Software Distribution.

With support for all major package formats, you can trust us to manage your software supply chain.

Start My Free Trial

Python logo articulo  0.1.4

One-liner (summary)

No summary - See description for more info.

Description

[![codecov](https://codecov.io/gh/mrmegatelo/articulo/graph/badge.svg?token=4UDVH5KHWP)](https://codecov.io/gh/mrmegatelo/articulo) [![Python Versions](https://img.shields.io/pypi/pyversions/articulo)](PyPI) [![PyPI - Version](https://img.shields.io/pypi/v/articulo)](Version) ![GitHub](https://img.shields.io/github/license/mrmegatelo/articulo)

# Articulo Tiny library for extraction articles from html. It can extract the content of an article, both in text and HTML, and it's title.

## Usage ### Basic usage This library is designed to be as simple as possible. To start using it just import it and instantiate with link you want to parse as a parameter.

Also the library designed to work in lazy manner. So, until you make a request for some property, it does not send any requests.

```python from articulo import Articulo

# Step 1: initializing Articulo instance article = Articulo('https://info.cern.ch/')

# Step 2: requesting article properties. All properties resolve lazily. print(article.title) # article title as a string print(article.text) # article content as a string print(article.markup) # article content as an html markup string print(article.icon) # link to article icon print(article.description) # article meta description print(article.preview) # link to article meta preview image print(article.keywords) # article meta keywords list ```

### Verbose mode In case you want to see the whole procees just provide parameter verbose=True to the instance. It can be helpful for debugging.

```python from articulo import Articulo

# Step 1: initializing Articulo instance article = Articulo('https://info.cern.ch/', verbose=True) ```

### Controlling information loss coefficient The whole idea of parsing article content is to define the part of the document that has the highest information density. To find that part there is the so-called information loss coefficient. This coefficient determines the decrease in the text density of the document during parsing.

The default value is 0.7 which stands for 70% information density decrease. In most cases, this works fine. Nevertheless, you can change it in case you have insufficient parsing results. Just provide theshold parameter to the articulo instance, it might help.

```python from articulo import Articulo

# Step 1: initializing Articulo instance article = Articulo('https://info.cern.ch/', threshold=0.3) ```

### Providing headers In some cases you need to provide additional headers to get an article html from url. For that case you can provide headers with http_headers parameter when you create new instance of articulo.

```python from articulo import Articulo

# Initializing Articulo instance with custom user agent headers = { 'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/111.0.0.0 Safari/537.36' } article = Articulo('https://info.cern.ch/', http_headers=headers) ```

Size

17.6 KB

Downloads

5

Status  Completed
Checksum (MD5) 8a74267b6b80f39489223fa3119c2d84
Checksum (SHA-1) 30ec06a78886fee44b026da2a592902e21d8c4be
Checksum (SHA-256) 0d9346f146df168b9e1edf34cc54121ca3478e84b5267d4dfbd00de489f2cc15
Checksum (SHA-512) d272edd02d5c4841034571c0766c00a68368c0094b1b0fc672815fa10f4bf7ff6e…
GPG Signature
GPG Fingerprint 6811684bac0b8895434e97bdd4391b8fb999e537
Storage Region  Dublin, Ireland
Type  Binary (contains binaries and binary artifacts)
Uploaded At 4 months, 3 weeks ago
Uploaded By Fetched by Cloudsmith
Slug Id articulo-014-py3-none-anywhl-n9ce
Unique Id NtsdW3vbcM7r6ynA
Version (Raw) 0.1.4
Version (Parsed)
  • Major: 0
  • Minor: 1
  • Patch: 4
  • Type: SemVer (Compat)
  extended metadata
Author Mikhail Ivanov <ya.ivanov-m2014v@yandex.ru>
Classifiers Programming Language :: Python :: 3 | Programming Language :: Python :: 3.10 | Programming Language :: Python :: 3.11
Metadata Version 2.1
Py Filetype bdist_wheel
Py Version py3
Requires Dist beautifulsoup4 (>=4.12.2,<5.0.0) | lxml (>=4.9.3,<5.0.0) | requests (>=2.31.0,<3.0.0)
Requires Python >=3.10,<4.0
pkg articulo-0.1.4-py3-none-any.whl 5
17.6 KB
md5 sha1 sha256 sha512
Package Contents (articulo-0.1.4-py3-none-any.whl)
Loading...

This package has 10 files/directories.

 Newer Python logo
articulo
 0.5.1
5 Fetched by Cloudsmith
 Newer Python logo
articulo
 0.5.0
5 Fetched by Cloudsmith
 Newer Python logo
articulo
 0.4.0
5 Fetched by Cloudsmith
 Newer Python logo
articulo
 0.3.2
5 Fetched by Cloudsmith
 Newer Python logo
articulo
 0.3.1
5 Fetched by Cloudsmith
 Newer Python logo
articulo
 0.3.0
4 Fetched by Cloudsmith
 Newer Python logo
articulo
 0.2.3
5 Fetched by Cloudsmith
 Newer Python logo
articulo
 0.2.2
5 Fetched by Cloudsmith
 Newer Python logo
articulo
 0.2.1
5 Fetched by Cloudsmith
 Newer Python logo
articulo
 0.2.0
5 Fetched by Cloudsmith
 Newer Python logo
articulo
 0.1.10
5 Fetched by Cloudsmith
 Newer Python logo
articulo
 0.1.9
5 Fetched by Cloudsmith
 Newer Python logo
articulo
 0.1.8
5 Fetched by Cloudsmith
 Newer Python logo
articulo
 0.1.7
5 Fetched by Cloudsmith
 Newer Python logo
articulo
 0.1.6
5 Fetched by Cloudsmith
 Newer Python logo
articulo
 0.1.5
5 Fetched by Cloudsmith
  Python logo
articulo
 0.1.4
5 Fetched by Cloudsmith
 Older Python logo
articulo
 0.1.3
5 Fetched by Cloudsmith
 Older Python logo
articulo
 0.1.2
5 Fetched by Cloudsmith
 Older Python logo
articulo
 0.1.1
5 Fetched by Cloudsmith

Last scanned

4 months, 3 weeks ago

Scan result

Clean

Vulnerability count

0

Max. severity

Unknown

You can embed a badge in another website that shows this or the latest version of this package.

To embed the badge for this specific package version, use the following:

[![This version of 'articulo' @ Cloudsmith](https://api.cloudsmith.com/v1/badges/version/demo-docs/awesome-repo/python/articulo/0.1.4/a=noarch;xf=bdist_wheel;xn=articulo;xv=py3/?render=true)](https://cloudsmith.io/~demo-docs/repos/awesome-repo/packages/detail/python/articulo/0.1.4/a=noarch;xf=bdist_wheel;xn=articulo;xv=py3/)
|This version of 'articulo' @ Cloudsmith|
.. |This version of 'articulo' @ Cloudsmith| image:: https://api.cloudsmith.com/v1/badges/version/demo-docs/awesome-repo/python/articulo/0.1.4/a=noarch;xf=bdist_wheel;xn=articulo;xv=py3/?render=true
   :target: https://cloudsmith.io/~demo-docs/repos/awesome-repo/packages/detail/python/articulo/0.1.4/a=noarch;xf=bdist_wheel;xn=articulo;xv=py3/
image::https://api.cloudsmith.com/v1/badges/version/demo-docs/awesome-repo/python/articulo/0.1.4/a=noarch;xf=bdist_wheel;xn=articulo;xv=py3/?render=true[link="https://cloudsmith.io/~demo-docs/repos/awesome-repo/packages/detail/python/articulo/0.1.4/a=noarch;xf=bdist_wheel;xn=articulo;xv=py3/",title="This version of 'articulo' @ Cloudsmith"]
<a href="https://cloudsmith.io/~demo-docs/repos/awesome-repo/packages/detail/python/articulo/0.1.4/a=noarch;xf=bdist_wheel;xn=articulo;xv=py3/"><img src="https://api.cloudsmith.com/v1/badges/version/demo-docs/awesome-repo/python/articulo/0.1.4/a=noarch;xf=bdist_wheel;xn=articulo;xv=py3/?render=true" alt="This version of 'articulo' @ Cloudsmith" /></a>

rendered as: This version of 'articulo' @ Cloudsmith

To embed the badge for the latest package version, use the following:

[![Latest version of 'articulo' @ Cloudsmith](https://api.cloudsmith.com/v1/badges/version/demo-docs/awesome-repo/python/articulo/latest/a=noarch;xf=bdist_wheel;xn=articulo;xv=py3/?render=true&show_latest=true)](https://cloudsmith.io/~demo-docs/repos/awesome-repo/packages/detail/python/articulo/latest/a=noarch;xf=bdist_wheel;xn=articulo;xv=py3/)
|Latest version of 'articulo' @ Cloudsmith|
.. |Latest version of 'articulo' @ Cloudsmith| image:: https://api.cloudsmith.com/v1/badges/version/demo-docs/awesome-repo/python/articulo/latest/a=noarch;xf=bdist_wheel;xn=articulo;xv=py3/?render=true&show_latest=true
   :target: https://cloudsmith.io/~demo-docs/repos/awesome-repo/packages/detail/python/articulo/latest/a=noarch;xf=bdist_wheel;xn=articulo;xv=py3/
image::https://api.cloudsmith.com/v1/badges/version/demo-docs/awesome-repo/python/articulo/latest/a=noarch;xf=bdist_wheel;xn=articulo;xv=py3/?render=true&show_latest=true[link="https://cloudsmith.io/~demo-docs/repos/awesome-repo/packages/detail/python/articulo/latest/a=noarch;xf=bdist_wheel;xn=articulo;xv=py3/",title="Latest version of 'articulo' @ Cloudsmith"]
<a href="https://cloudsmith.io/~demo-docs/repos/awesome-repo/packages/detail/python/articulo/latest/a=noarch;xf=bdist_wheel;xn=articulo;xv=py3/"><img src="https://api.cloudsmith.com/v1/badges/version/demo-docs/awesome-repo/python/articulo/latest/a=noarch;xf=bdist_wheel;xn=articulo;xv=py3/?render=true&show_latest=true" alt="Latest version of 'articulo' @ Cloudsmith" /></a>

rendered as: Latest version of 'articulo' @ Cloudsmith

These instructions assume you have setup the repository first (or read it).

To install/use articulo @ version 0.1.4 ...

pip install 'articulo==0.1.4'

You can also install the latest version of this package:

pip install --upgrade 'articulo'

If necessary, you can specify the repository directly:

pip install \
  --index-url=https://dl.cloudsmith.io/public/demo-docs/awesome-repo/python/simple/ \
  articulo==0.1.4

If you've got a project requirements.txt file, you can specify this as a dependency:

--index-url=https://dl.cloudsmith.io/public/demo-docs/awesome-repo/python/simple/
articulo==0.1.4

In addition, you can use this repository as an extra index url. However, please read our documentation on this parameter before using it. For example in a requirements.txt file:

--extra-index-url=https://dl.cloudsmith.io/public/demo-docs/awesome-repo/python/simple/
articulo==0.1.4
Warning: We highly recommend using pip (or similar) rather than installing directly.
Top